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Abstract 



B 

*-rt ■ Our object is to formulate and analyze a physically plausible and 

C I mathematically sound model to better understand the phenomenon 

of clustering in colloids. The term depletion force refers to the force 
(in the Newtonian sense) which is associated with the clustering. Our 
model is stochastic but derived from a deterministic setup in a New- 
£> ■ tonian setting. A mathematical transition from the deterministic dy- 

namics of several large particles and infinitely many small particles 
f ^ | to a kinetic description of the stochastic motion of the large particles 

I/"") ' is available. Assuming that the empirical velocity distribution of the 

[~^. ■ small particles is governed by a probability density, the mean-field 

force on the large particles can be represented as the negative gra- 
dient of a scaled version of that density. The stochastic motion of 
the large particles can then be described by a system of correlated 
Brownian motions. The scaling in the transition preserves a small pa- 
rS ' rameter, the correlation length. From the limiting kinetic stochastic 

C3 ■ equations we compute the probability flux rates for the difference in 
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position between two large particles. We show that, for short times, 
two particles sufficiently close together tend to be attracted to each 
other. This agrees with the depletion phenomena observed in colloids. 
To quantify this effect, we extend the notion of van Kampen's one- 
dimensional probability flux rate in an appropriate way to account for 
higher dimensional effects. 
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1 INTRODUCTION 5 

1 Introduction 

A kinetic model for Brownian motion was introduced by A. Einstein [l(J and 



by M. von Smoluchowski |J g]; a corresponding dynamic model was analyzed 



by Uhlenbeck and Ornstein | 4|. The mathematical model, first formulated 



for a single large particle suspended in a stationary liquid, can be generalized 
easily to models of several large particles under the assumption that their 
positions, as well as their velocities, are spatially independent. The require- 
ment of independence is a central assumption in the theory of stochastic 
differential equations, mathematical particle systems and their macroscopic 
limits. However, this assumption appears to be in conflict with the fact that 
the "suspended particles all float in the same fluid. "Q This begs the ques- 
tion of how to incorporate correlations for physically relevant models. We 
address this question by considering systems of two types of interacting par- 
ticles -- large particles and small particles; the terms large and small here 
refer to their different masses. After a suitable transition from micro-scales 
to meso-scales, the positions, as well as the velocities, of the large particles 
will perform Brownian motions. 

If there were only a single large particle, the fluid around it would ap- 
pear homogeneous and isotropic, which leads to a relatively simple statistical 
description of the displacement of that particle as a result of the collisions 
with the small particles. Under the assumption that the collisions of the 
large particle with the small particles are elastic, a Brownian motion as an 
approximation to the position and velocity of the particle has been obtained 
by several authors (See the references in Kotelenez | 7||). Under the same as- 



sumptions, if there are two large particles, sufficiently far apart that the fluid 
around each may be considered homogeneous and isotropic, each would be 
expected to experience a Brownian motion as an approximation to its posi- 
tion and velocity; moreover, these Brownian motions would be independent. 
(See Figure ([[], A) below.) 

On the other hand, if the two large particles are very close together, the 
fluid around each will no longer appear homogeneous and isotropic. (See 
Figure ([[], B) below.) In fact, the fluid between the two large particles will 
get depleted in the sense that fewer small particles per unit volume will be 
found between them (See Asakura, S and Oosawa, F. H, Goetzelmann, B., 



Evans, R. and Dietrich, S. fTTH and the references therein.) Thus, if the two 



1 Spohn Q, Part II, Section 7.2. Emphasis added. 
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1 INTRODUCTION 6 

large particles are sufficiently close together, the collisions of the many small 
particles with them cause their motions to become statistically correlated. 
The length scale at which this occurs is measured by the correlation length, 
which we denote by \fe. 

To capture this effect faithfully, we analyze a model in which the elas- 
tic collisions between large and small particles are replaced by a mean-field 
interaction. The mean-field force is derived from the probability density of 
the velocity field of the small particles. The scaling preserves the correla- 
tion length. The positions and velocities of N, N > 1, large particles and 
infinitely many small particles are given by a deterministic system of cou- 
pled nonlinear equations with independent random initial conditions. The 
equations are coupled through the rescaled mean-field force. Using coarse 



graining in space and time, Kotelenez | J obtained N correlated Brownian 
motions in a scaling limit as an approximation to the positions of the N large 
particles, where the spatial correlations are computed from the probability 
density of the velocity field of the small particles. We consider here the cor- 
related limiting diffusions, outlining the main features of our analysis but 
focusing mainly on the interaction between just two large particles (N = 2). 
The rest of this paper is organized as follows. In Section we first de- 



scribe the excluded volume model of Vrij and de Hek |18| , which is equiv- 
alent to the model of Asakura and Oosawa ||. We continue with a brief 
discussion of three experimental studies of the depletion force, each more 
stochastically oriented than its predecessor. The final, and most stochasti- 
cally oriented, gives results that agree nicely with the hard sphere model of 



Vrij and de Hek [46|, [18|] . In Section ||] we outline the underlying interacting 



particle model and the stochastic limit referred to above. The notation in 
Section [2| is consistent with the literature referred to in that section, but gen- 
erally differs from that which we use in the rest of this paper. In Section f| we 
set out the notational conventions used in the other sections. In this section 
we also provide a statement of the fundamental Principle of Material Frame 
Indifference together with a useful lemma. In Section [^ our stochastic model 
is formulated in detail for the case of two large particles. This section culmi- 
nates with a derivation and description of the diffusion matrix that governs 
the stochastic process for the separation between the two large particles. In 
Section |6], we continue with an extensive discussion of this stochastic process. 
The depletion effect is revealed and quantified in Section |7| as a consequence 
of our model through our d— dimensional version of van Kampen's probability 
flux rate. Finally, in Section || we summarize our results and lay out avenues 
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2 INTERACTING PARTICLES — DEPLETION EFFECT 7 

of further research. 

2 Interacting Particles — Depletion Effect 

Consider a mixture of small, spherical particles of diameter a s and big, spheri- 
cal particles of diameter a& so that a s <C aj,. Assume that the number fractions 
obey, Xb <C x a \ the reference state is the very dilute solution with respect 
to the big particles. In this state, the probable separation between any two 
big particles is at least 10a s .This separation is sufficient to ensure that the 
distribution of small particles around the each big particle is uniform with 
spherical symmetry. The configuration is shown in Figure (fl], A). 




Figure 1: 

A. Two large particles are separated sufficiently so that the packing around 
each particle is isotropic on average. 

B. If the two large particles are closer than about 2a s the distribution of 
small particles is depleted in the region between them. 



When two big particles move together the configuration space of the small 
particles changes when the separation is less than about 2a s as represented 
in Figure ([I], B). The reduction of small particle density in the space between 
large particles that are sufficiently close is referred to as depletion. 

Experiments early in the last century showed that colloidal systems com- 
posed of dispersions of particles and macromolecules show interesting floc- 
culation or agglomeration phenomena, which was not understood until the 
paper of Asakura and Oosawa [0. They describe the case of parallel plates 
immersed in a solution of hard, macromolecular particles. Then they as- 
sert that as the two plates are brought close together an attractive osmotic 
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force develops as a result of depletion of the macromolecules from the volume 
between the two plates. 

Vrij and de Hek ]46], [L8| independently rediscovered the depletion effect 
and carried out experimental and theoretical studies that involved coated 
silica spheres, the large particles, of about 50 nm radius in dispersions con- 
taining polystyrene, the small particles, mixed in cyclohexane. The marker 
for the depletion effect is the phase separation, which was observed with the 
variation of polystyrene concentration. 



2.1 The excluded volume model of Vrij 



The hard sphere model of Vrij ]46| and Vrij and de Hek |18| is equivalent 
to that of Asakura and Oosawa [|TJ and is shown in Figure @j. In many 
experiments the small sphere is a coiled polymer, which is approximated 
by assuming the polymer component behaves as a small hard-sphere when 
interacting with a large hard-sphere but penetrable when interacting with 
another small "hard-sphere." This is the penetrable hard sphere (PHS) model, 
shown in Figure (0). 



a 



q = i/2(a b + a s ) 




1/2 r 



Figure 2: The Vrij model [|T8] , ||46|| , while crude, provides physical insight 



and fits dilute systems, see Figure 



Briefly, two large particles of diameter a& are moving toward collision and 
assume that the large particles are hard so that the force is unbounded when 
r < a^. The number density of the small particles, of diameter a s , is assumed 
sufficiently dilute that the osmotic pressure, P os , is ideal, P os = c s k B T. The 
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excluded volume produced by the hard sphere interaction between the small 
and large particles is traced by the dashed curve of radius a = \{a b + a s ) 
in Figure fj. Notice that the normal, n, points into the surrounding phase 
whereas the osmotic force, / = —P os n, points inward. At a particular sepa- 
ration, r, the two excluded volumes overlap and c s is diminished there. The 
osmotic force, projected along the center to center axis, n , is attractive, 



formula (|2.1| , |2!J). Following Vrij, the depletion force, K is estimated as 

K= [ f-nodA. (2.1) 



Here, / ■ n = —P os cos 9 and from Figure (0) cos6> = \rja so that following 
Vrij 

{oo for r < a& 
-7ia 2 (l-\(^ 2 )c s k B T fora b <r<2a . (2.2) 
for r > 2a 

The potential is 

p2a 

V{T,a,c s ,r) = K(T,a,c s ,f) dr 

J r 

oo for r < a& 



for r > 2a 



-|7ra 3 (l - |(i) + ^(^) 3 )c s ^T for a b < r < 2a 



(2.3) 



Vrij and his collaborators certainly recognized that this model is crude, how- 
ever, it suggested experiments that sharpen the physical chemistry of the 
depletion effect. The hard sphere physics can be approximated by dressing 
small particles in layers that reduce the Van der Waals attractive force and 
the coulombic interactions that obtain in many colloid systems. See, for 
example, de Hek and Vrij JO 



The papers by Henderson et al. \ 0], Gotzelmann, et al. \ 7], Zhou |< 



Biben, et al. H, and Kinoshita J \ represent the literature wherein a funda- 
mental approach is developed to investigated a multicomponent hard-sphere 
dispersion. Also, see the review by Tuinier, et al. [[41]. 

We did not find any articles that addressed the effect of depletion on the 
correlated, stochastic motion of hard-sphere, multicomponent dispersions. 
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2 INTERACTING PARTICLES — DEPLETION EFFECT 10 

However, there are experimental techniques and results that are pertinent to 
the theory developed herein. 

2.2 Some Experimental Studies of Depletion Forces 

As developed in the early work of Vrij and de Hek Q, [il| and reviewed by 
Tuinier, et al. ||41| , the details of the phase behavior of polymer, colloid mix- 



tures yield information about depletion effects. For example, silica particles 
coated with stearyl alcohol dispersed in cyclohexane will provide close to in- 
dex matching conditions so that the Van der Waals force between the coated 
particles is ignorably small. Moreover, the effect of any electrical double layer 
is negligible. Such a system is very close to a dispersion of hard spheres. The 
interaction of two large particles allows the determination of the depletion 
effect induced by the number density of the polymer component. 

2.2.1 Scattering techniques and colloid properties that show de- 
pletion effects 

Static and dynamic light scattering measurements are capable of measuring 



depletion effects as outlined by Tuinier et al. J | who describe a small an- 
gle neutron scattering (SANS) study of casein micelles and an exocellular 
polysaccharide system that behaves as a PHS system. They also report dy- 
namic light scattering and turbidity results. Briefly, in the case of SANS, the 
Rayleigh ratio R{q) is determined experimentally as a function of the wave 
number, 



2 f^j sin(0/2), (2.4) 



where Ao is the vacuum wavelength of either photons or neutrons and 9 is the 
scattering angle. See Berne [[|, for example. The Rayleigh ratio is related 
to the structure factor, S(q) and the particle form factor, f(q) through the 
following formula 

R(q) = Kmcf(q)S(q) (2.5) 

where K is a material coefficient, m the mass of the particle, c the concentra- 
tion. The form factor, /, depends on the particle radius as well as q and can 
be computed theoretically (See Berne ||). The form factor may also depend 
on the energy of the incident photons (or neutrons). The structure factor is 
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is the Fourier transform of the radial distribution function of the particles, 
g(r), that are scattering: 



oc 



2 sin(gr) 



S(q) = l+4irp (g(r) - l)r 2 — K -^dr. (2.6) 

Jo 1 r 

The function g(r) is the probability density of finding a pair of particles 
separated by a distance r. Therefore expect that g(r) will depend on the 
number density of large particles and on their interactions, including the 
depletion effect. See Tuinier et al. [|4T], | 2| . 



Comment 2.1 

Scattering functions such as formula ( \2.(\ ) are functions of "reciprocal" space, 
which is defined through the Fourier transform, for example 

f(q) = f f(x)e-^dx (2.7) 

defines the "reciprocal" space representation of the function f with domain 
of definition in "direct" space, M. d . The reciprocal space representation ex- 
presses the physics of the scattering phenomenon as a result, for example, of 
solving Maxwell's equations with appropriate boundary conditions. However, 
it is often useful to Fourier transform the data so as to search for symmetry 
elements. In scattering experiments, formula ( $.4 ) is an expression of the 
vector momentum, p = hk, balance, for example, of a photon scattered by 
fluctuations is 

ftki nc = hk sct ± hq (2.8) 

q = k \e inc -e sct \ (2.9) 

where e inc is the direction of the incident beam and e sct the direction of the 
scattered beam. The wavenumber k is defined as k = 27r/A , where A is 
the wavelength of the incident beam. 

When the scattering is kinematic, which obtains for single scattering 
events, Maxwell's equations teach that the field at a detector distant from 
the illuminated volume, Vj, which includes the particles (1, • ■ • N), is 

(l,-AT)CVj 

¥>(?)= E fn(q,E)e-^l (2.10) 

n=l 
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2 INTERACTING PARTICLES — DEPLETION EFFECT 12 

Here, x n (t) is the trajectory of the nth particle in the volume Vj. If the system 
is monodispersed, all large spheres are of the same radius, f n (q, E) = f(q, E) 
independent of n. In that case 

(1,-N)cVj 

y{q) = f{q,E) £ e~^^ (2.11) 

n=\ 

and the sum, the intermediate scattering function, is the Fourier transform 
of 

(1,-AOcVz 

X (x)= J2 <*(*-**(*))■ (2-12) 

n=l 

Note that {x n (t)} is available from molecular dynamics simulation. 

Dynamic light scattering is observable as a correlation function of the 
photocurrent, which amounts to an average of the square of the field at the 
detector: 

< ip(q,t)tp*(q',t') >=N< f{q,E)f{q,E) > 

(1,-JV)CV/ 

+ <f(q,E)f*(q,E)> ^ < e -#(*»(t)-*«i(f)) > . (2.13) 

n,n=l 

Experimentally, the self-beat method of dynamic light scattering provides a 
correlation function in time and wavenumber of the particles and is accurately 
portrayed in very dilute solutions by < \x n (t) — x n (0)\ >= 6D t where 
the Stokes-Einstein diffusion coefficient is D = (kBT/6iir]) x (|a). Here, 
rj is the viscosity of the medium, a is the hydrodynamic diameter of the 
particle. In addition, the system is so dilute that each particle moves as 
an independent Brownian particle. Indeed, experimental evidence supports 
the assumption that the dynamics are represented by the set of independent 
Langevin equations: 

dv n = -pv n dt + a n {dt) (2.14) 

here f3 = 6irr] x (|o)/m and m is the mass of the large particle. The 
random accelerations are connected to /3 through a correlation function and 



July 31, 2008 - 13h:54m 



2 INTERACTING PARTICLES — DEPLETION EFFECT 13 

a fluctuation dissipation theorem.^ The correlation function is 

< a n (t) ■ aS') >= bS n J(t - t') (2.15) 

and a fluctuation-dissipation theorem for this case gives b = 2(3^- = 2(3 2 D. 
Interparticle forces change the picture. Now the Langevin equation reads 

dv n = F n dt — [3v n dt + a n (dt) (2-16) 

where 

All 
Fn(Xn(t)) = J2^n'{x n {t),X n! {t)) + --- . (2.17) 

n'=l 

It is often assumed that the pair forces dominate to the exclusion of triplet 
and higher contributions considered ignorable. A further simplification is to 
assume that F n > depends only on the separation \x n {t) — x n >(t)\ . When the 
particles satisfy the PHS model, it is clear that F n will affect the correla- 
tion function of dynamic light scattering as will also the necessarily different 
relationship between f3 and b. 

The focus of this paper is the form of a stochastic term analogous to 
that in formula ( |2.14| ), which in the hard sphere case is shown to involve 



a depletion effect. The more general case wherein intermolecular forces are 
other than hard sphere must still reflect the depletion effect derived herein. 



2 See van Kampen |f45| 
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2.2.2 Measurement of forces between two large particles and be- 
tween a large particle and a plate. 

It is now possible to measure the forces between a micron to submicron size 
particle directly against a fiat plate or a second particle. The basic idea of 
the measurement is shown in Figure (0). 




Figure 3: Conceptualization of the force measurement as the two particles 
are forced together. 



The force measurement as the two particles come together is conceptual- 
ized by an atomic force microscopy (AFM) measurement where the cantilever 
is replaced by a spring attached to the upper particle. As the particles are 
brought to small separation, h = r — a&, a change in Ax is observed and 
transformed into a force since the spring constant is known. Here a& is the 
diameter of the large particles. The force experienced by the large particle is 
proportional to Ax scaled by the spring constant in the figure. The AFM ex- 
periment is usually done with a flat plate instead of the lower particle shown 
here. The function Ax will fluctuate in time as a result of impacts of the 
small particles on the large particles. 



Atomic force microscopy (AFM) AFM experiments have produced in- 



teresting depletion data, see for example Knoben et al. | |], Clark, et al. 0, 
and Tulpar, et al. [ |]. However, AFM experiments suffer from two related 
problems. When the attractive force is too large, the upper particle will 
snap to the lower particle or plate, therefore, the full force curve is not gen- 
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erally available. Providing the optimum spring constant is difficult. It is also 
difficult to define accurately the point of contact between the sphere and 
substrate and therefore the actual separation between the particle mounted 
on the cantilever and the substrate is poorly defined. Progress to resolve this 
difficulty is discussed in references ] J and |3l| . Biggs, et al. J J point out 
that while TIRM (total internal reflection microscopy) is more sensitive, the 
technique cannot measure strong forces; the two techniques AFM and TIRM 
are complementary. 



Total internal reflection microscopy (TIRM) has the advantage of us- 
ing an evanescent wave to interrogate the position of a large particle with 
respect to an optical surface, see Tuinier et al. [j|l[] for a brief description of 
the technique and see references , f36f , |34j and || for reports of depletion 
effects observed. Also, see Figure (f|). 




Figure 4: The laser beam (in red) is incident on the flat interface at an angle 
consistent with total internal reflection. The evanescent wave, heavy (red) 
line, attenuates exponentially into the solution phase. The intensity, 1(h), of 
the light scattered by the particle, (red) arrows is detected. 



In Figure (§), the intensity, 1(h), of the light scattered by the particle (red 
arrows) is detected and follows the rule 1(h) = 1(0) exp(-Ah) where 1/A is 
the 1/e penetration depth of the electric field. The coefficient A depends on 
the refractive index numbers n\ and ri2- The scattering intensity depends on 
ri2 and n^. In addition to the depletion forces (and colloid forces in general), 
the particles sediment, g is the acceleration of gravity. 
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Consider Figure (||), taken from Bechinger et al. PB[ , which shows the 
variation of fluctuation amplitudes with concentration of the small particles. 



o.oo -, 



0.05 



5* 0.10 



in 

c 

0) 



0.15 



0.20 



0.25 




h>0 



h = 



200 400 600 800 1000 

Time (sec) 



Figure 5: TIRM fuluctuations in intensity observed when small spheres are 
absent. 



The authors used a dilute solution of polystyrene spheres of 1.5/xm radius 
dispersed in water. Polyethylene oxide with molecular weight of 2 x 10 -6 
Dalton was added, which had a radius of gyration of 0.01/im and behaved 
as a small hard spheres. No polymer was added for the time-series shown 
in Figure ([5]), so that the potential constructed from p(h), the probability 
density that h will occur, generated from this time series, gave the function 
shown in Figure (^j for ifs = 0.0. The raw data for cp$ = 0.032 were probably 
similar to that of Figure (||) but was not reported in Bechinger et al. |36| . 
Note that at h — 0, the intensity of the scattered light will reach a maximum. 
The bar just above the 0.25 tick indicates the intensity for h = 0. Figure (O) 



shows the change in the amplitude of particle fluctuations along the normal to 
the glass substrate. They convert the data of Figure (|^) to potentials through 
the determination of p(h), then, the potential, V(h), essentially a free energy, 
is computed through the canonical ensemble (Helmholtz) distribution density 
taken as 



p(h) 



ce 



-V{h)/k B T 



(2.18) 



Unfortunately, these and other authors did not determine the space-time 
correlation functions of these systems. 
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Comment 2.2 

Since the Hamiltonian of the B-particle observed in the evanescent optical 
field is of the form H = ^-p ■ p + V(h), the joint probability density factors 
so that formula ( \2.1§ ) obtains where c is the inverse partition function, - 

roc 

Jo 



-V{h)/k B T 



dh. Note that the particle is constrained to h > 0,see Figure 



(fjj. Formula ( \2. 1 1\ ) is considered exact, see Henderson et al. [2(\] . 



A second paper, by Bechinger et al. [3c] showed an interesting pattern; 
the AO - Vrij model works reasonably at low concentrations of polymer but 
a maximum is found at higher concentrations. See Figure^. 
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Figure 6: A. The total potential is shown and is considered to be a sum 
of double layer, sedimentation, Van der Waals and depletion effects. The 
variation with separation between the large particle and the plate is h. The 
polymer concentrations (number density) are n = 0, or a volume fraction of 
(ps — 0.0 and n = Q.3fim~ 3 or (p$ = 0.032. B. Shows the difference between 
the data of the curves <ps = 0.032 and 0.0, which represents the depletion 
component of the potential. (Redrawn from Figure 1 of Bechinger ||36|| .) 



That a potential barrier develops at higher concentrations of polymer is 
further supported by experiments using laser tweezer technology. Also see 
Zhou H§. 

Oetama and Walz JJ3J] report the result of a study of short-time dynamics 
with a focus on determining the diffusion coefficient of the large particles in 
the presence of the small particles. 
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Laser tweezer results Particles can be trapped in the focal volume of a 
laser if the refractive index difference between the particle and liquid is of 
sufficient magnitude. Once trapped, the force that the particle experiences 
with respect to external sources can be measured down to nanometer length 
scales and sub-picoNewton forces, for example, between two 5-particles. See 
Figure (0). 




Figure 7: The focus of a laser beam is rastered at a sufficient rate that the 
particle is trapped to move in a tube of length L. L is sufficiently large 
that the depletion force is ignorably small for that maximum separation. 



The Brownian motion of the particles is tracked in time. Reference [. | and 
papers cited therein provide details of the measurement technique. 



Crocker, et al. [ J used a tweezer system that scans a roughly 10/im 
trapping line in a thin sample cell. In a typical run, a pair of large spheres 
(a^ = 1.100 ±0.015/wn diameter) of PMMA (polymethylmethacrylate) was 
trapped in the tweezer and one-dimensional Brownian motion was monitored. 
Various volume fractions of a s = 0.083yum diameter polystyrene spheres (ps) 
provided the background. Here too, the pair potential is computed from the 
observed distribution of separations of a pair of particles through the formula 

pflxi - x 2 \) = ce-n\Si-S2\)/k B T _ ( 2 19 ) 

Comment 2.3 

In this case, the Hamiltonian is H = ^r-pi ■ P\ + ^f>2 • Vi + V(x\, x?)- As- 
sume that V{x\,X2) = V(\x\ — X2I), then the maximum entropy principal 
provides formula (\2.1Q) . This formula is considered exact given the form of 
V, ft2C\J. Triplet depletion forces may be important enough to be considered, 
see Melchionna and Hansen Ml 
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At low volume fractions of ps, they confirmed the model of AO-Vrij as 
shown in Figure M A.). 
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Figure 8: 

A. The laser tweezer method was applied to very dilute solutions of ~lfim 
diameter large hard-spheres. Small volume fractions, f s , of ~0.08/im diame- 
ter small hard-spheres were added. The pair interaction energy was fit well 
by the AO-Vrij theory. 

B. The particle sizes are the same as noted for Figure A. A range of volume 
fractions was reported by the authors as shown. Note the structure that is 
evident at ip s > 0.07. Note that r = \x± — X2I and that h = r — a?,. These 
Figures were redrawn from Figures 1 and 3 of reference || . 



The fit to the AO-Vrij theory is convincing that the physics was captured 
quite well by a very simple PHS theory. However, Crocker et al. | J report 
a significant deviation from the AO-Vrij theory at larger volume fractions, 
ip s > 0.07, as shown in Figure (|8| B.). Also see Zhou [Q for a theoretical 
viewpoint. The authors show that there is a substantial depletion repulsion 
at separations that correspond to approximately one small sphere diameter 
when tp s > 1. 

The time-space correlation functions for the laser tweezer experiment as 
well as the TIRM experiment are in principle observable. 



The molecular dynamic equivalent of the laser tweezer; steered 

MD. For example, see Park and Schulten Q | on calculating potentials 
of mean force from steered molecular dynamics simulation. The molecular 
dynamic equivalent of the laser tweezer experiment is formulated as follows. 
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The equations of motion of the small particles are for each particle, s = 

1, z, . . . , fTsi 

dv s => => 
rrie—r- = F s \ s + F, 



dt 



s\s "I" r s \b 



M, . (2 ' 20) 

nr =v ' 

where m s is the mass of the small particle, v s is its velocity, F s \ s = Yla=i ^ sa 
is the sum of forces between the small particle s and the other small particles 
a, F s \b = J2 n=1 F sn is the sum of forces between the small particle, s, and all 
of the -B-particles. The small particles are not otherwise constrained (other 
than for the usual cyclic boundary conditions). 

Analogous to the laser tweezer experiment, a constraint to the motion 
of the large particles can be imposed through adding to Equation (|2.20|) 
and external force field E. The equations of motion for the large particles 
(B— particles) are 

m B-j- = F b \ b + F h \ s + E, 

?£ (2.21) 

dxb 

-dT = Vb ' 

where tub is the mass of the large particle and Vj, its velocity. The forces 
are computed between the two large particles, F b \b and -B-particles interacing 
with small, F b \ s = Ylt=i ^ ba - ^- n addition, the large particles are constrained 
to move on a line (without friction), but allowed by a harmonic force function 
to experience small fluctuations away from the line: 

E(x, y, z) = -K B (xe x + ye y ). (2.22) 

where Kb is the spring constant and e x , e y are unit vectors perpendicular to 
the z axis taken along the line of constraint. 

The various forces F sa ,F ba , and F b \ b may be hard sphere, PHS or follow 
some simple but convenient force law such as the gradient of L-J 12-6 poten- 
tials. The specifications of these force laws allow the accurate separation of 
the depletion effect from the total force of interaction. 

A protocol for the simulation involves the initial positioning of the parti- 
cles and assigning initial velocities from a Maxwell distribution to the small 
particles. Initially, the I?-particles are fixed along the z-axis separated by 
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about 10 small particle diameters and the small particles will redistribute 
following the dynamical Equations (|2.20|) until equilibrium is reached as sug- 
gested by Figure ([I], A). Then the 5-particles are released and given an 
initial speed drawn from a Maxwell distribution and allowed to move toward 
collision following the dynamics of Equation ( p.21| ). 

It is clear that the characteristic time constant can be quite different 
for the two classes of particles and that makes the simulation of a binary 
solution with many 5-particles awkward if not impractical. Indeed if the 
set of -B-particles are free to move but their number density is small, expect 
collisions to be so rare that the depletion effect will be rare and therefore the 
statistics of the depletion potential will be very poorly represented. However, 
the harmonic force, Equation ( |2.22| ), steers two particles sufficiently that 
they will collide relatively often and thereby enhance the determination of 
the distribution function from which the pair potential for the B-particles is 
estimated. Also, since the acceleration, velocity and position of all particles 
are available it is possible to compute averages such as < a n {t) ■ a^it') > and 
compare these "experimental" numbers with theory. 

3 Underlying Model and Stochastic Limit 

In constructing our model we speak of particles instead of atoms or molecules. 
The term solute refers to the large particles and the term solvent to the 
medium of small particles. In 1905 Einstein [|1(| developed a model of Brow- 
nian motion to describe the motion of the large particles as a result of their 
interaction with the small particles. He assumed that the motions of the large 
particles are statistically independent provided the system is very dilute; that 
is, the large particles are far apart from one another. (See Figure ([1], A).) In 
rigorous mathematical treatments of several Brownian motions (Wiener pro- 
cesses), the assumption of independence, regardless of separation distance, 
has become widely accepted. However, we saw in Section [2| that when the 
large particles are close together the depletion effect induces a force that at- 
tracts the particles to one another. In particular, the motions of the large 
particles become statistically correlated when they are close, as measured by 
the correlation length y/e. 

We therefore seek a model for correlated Brownian motions of the large 
particles that satisfies the following four 
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Desiderata: 

1. The marginal motion of any single particle is Brownian (Wiener pro- 
cess) .0 

2. If the particles are widely separated (dilute system), they perform ap- 
proximately independent Brownian motions. 

3. If the separation between particles is small, their motions are correlated. 
Moreover, the correlation is such that if the separation is sufficiently 
small, as measured by the the correlation length, they tend statistically 
to approach one another further. 

4. As the correlation length tends to zero, the particles become 5— corre- 
lated in space and time.f] 



Kotelenez | | obtained a class of correlated Brownian motions as a scaling 
limit for the positions of several large particles immersed in a medium of 
infinitely many smaller particles.0 Here is a brief sketch of this work. 

We need to consider two levels of description of the particle system. On 
the microscopic level, we suppose Newtonian mechanics governs the equa- 
tions of motion of the individual atoms or molecules. These equations are 
cast in the form of a system of deterministic coupled nonlinear equations. 
The next level is called mesoscopic. On this level the motion of the large 
particles is stochastic; the randomness of their motions is determined by the 
surrounding medium. Here, spatially extended particles are replaced by point 
particles; large and small particles are distinguished by their large and small 
masses. f] Furthermore, the interaction between small particles is assumed to 



3 Since we are dealing here with the joint motion of several large particles and each 
motion is, in a sense, an infinite dimensional random variable, the term marginal must 
properly be defined in this context. We do this for pair- motions (two large particles) in 
Appendix p^3. 

4 That is, in the limit the particles become uncorrelated unless they collide. 

5 Kotelenez Ea, M(1995) introduced correlated Brownian motions as a driving term 
in stochastic ordinary differential equations (SODEs) and stochastic partial differential 
equations (SPDEs). (Sec, for example, Equation ( |3.5| .)) 

6 For a rare gas a mean-field force can be a result of coarse graining in space an d tim e, 
where on a finer scale the interaction is governed by collisions. See also Comment p.l[ 
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be negligible and interactions between large particles can (temporarily) be 
neglected .[] 

We suppose that the interaction between large and small particles is gov- 
erned by a scalar- valued potential of the form <p(\r — q\ 2 ), where r denotes 
the position of a large particle and q denotes the position of a small particle. 
Thus we assume that the potential does not depend on the locations of the 
two particles but only on their vector difference, r — q; it is a homogeneous 
or shift invariant function of r and q. In fact, we assume that the potential 
depends only on the magnitude of the difference or the separation, \r — q\; it 
is an isotropic function of the difference.^ This case will be studied in detail 
in Section |[ Recall that Brownian motion is interpreted to be the result 
of collisions between many, fast moving, small particles and a few, slowly 
moving, large particles. Often it is assumed that these collisions are elastic. 

Specifically, we suppose that the force g e ^{r — q) on a large particle at r 
due to a small particle at q is derivable from the potential e ,fi'- 

9eA r ~ «) := -V<M|r - q\ 2 ) = -2(r - q)<p' £ J\r - q\ 2 ). (3.1) 

The potential function (p e ^ and, hence, the force g £ depend on two param- 
eters: the correlation length yfe and a time-scale parameter /i.[] 

In the classical model of a dilute system of large (Brownian) particles, 
mentioned above, the fluctuation forces /(r 1 ,^) and /(r 2 ,^) on the po- 
sitions of two large particles, located at r 1 and r 2 at times t\ and £2, 
are assumed to be 5— correlated in space and time: (/(t* 1 ,^)/ (r 2 ,t 2 )) oc 
(5(r x — r 2 ) (g) 8(\t\ — t^l), where 8 and S denote Dirac's 5— function in M. d and 
K..Q In contrast, our model can capture the qualitative behavior of both 
dilute and non-dilute systems. Since we consider 5— correlated noise to be 
an approximation to the more realistic spatially correlated noise, we require 



7 As the interaction between large particles occurs on a much slower time scale than 
the interaction between large and small particles, it can be included after the scaling limit 
employing fractional steps (See Goncharuk and Kotelenez pi). 

8 This assumption is entir ely consistent with the Principle of Material Frame Indiffer- 



ence, discus sed in Su bsection |4.2| . In fact, for a homogeneous scalar function of two vector 
arguments, isotropy is equivalent to isotropy in the difference. Here \r — q\ denotes the 
Euclidean distance in the state space W* between the two particles. 

9 The parameter /j, has the units of reciprocal time ([=]^), V denotes the spatial gra- 
dient in K rf , and the prime (') denotes differentiation with respect to the scalar argument. 

10 In terms of stochastic analysis this means that the "noise" (replacing the solute of 
small particles) is white in space and time. In particular, this implies the independent 
increments in Brownian motions. 
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in Desideratum [| that the fluctuation forces associated with our model be 
5— correlated in the limit as the correlation length yfe \ 0. 

Suppose there are N large particles and infinitely many small particles 
distributed in the Euclidean state space R^.Q At time t, the position of the 
ath large particle is denoted by r a {t) and its velocity is denoted by v a (t). 
The position and velocity of the Ath small particle are denoted by q x (t) and 
w x (t). The mass of a large particle is mi, the mass of a small particle is m s , 
and we suppose that m s <C mi. The empirical mass distributions of large 
and small particles are (formally) given by 

N 

X(dr,t) = mi2_ J x ~r a (t){dr) and y (dq, t) — m s 7] $g x (t) (dq) ■ (3.2) 

q=i AeN 

We assume, at the microscopic level, that the interaction between small and 
large particles can be described by the following infinite system of coupled 
nonlinear dynamic equations^ 

±v a (t) = -„v a (t) + — / g £ Jr a (t) - q)y(dq,t), 

dt m t m s J Rd '^ , . 



d_ w., 

dt 



w\t) = — f g (r - q\t))X(dr,t), 

mim s J Rd ,/x 



where a = 1, 2, • ■ ■ ,N and A G N. The positive time-scale parameter, /i, is 
also introduced here as a Stokes-friction parameter associated with the large 
particles. To specify an evolutionary system we append the random initial 
conditions: 

r°(0) = r£, v a (0) = v%, q x (0) = q x , w x (0) = w x . (3.4) 



11 Infinitely many small particles are needed to generate independent increments in the 
limiting Brownian motion. See our Comment p73 and Kotclcnez [B7J. 

12 Note that Equation (3.1) implies g e (r — q) = — g £ (q — r); that is, the force on r 



due to q is necessarily equal and opposite to the force on q due to r. Strictly spe aking 



g e „ plays the role of a force density or force per small particle in Equations (3^2) and 
therefore has units of force per unit volume {\=\?^^j^). 
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The resulting mesoscopic model of correlated Brownian motions can be 
defined as follows: Let w(dq,ds) denote standard Gaussian white noise on 
M. d x M + , which is a space-time generalization of the time increments of a 



standard scalar-valued Brownian motion. (See Appendix [E 1] for details of 
this generalization.) The white noise and the random initial data are defined 
on the same probability space and are assumed independent. 

In the stochastic limit, the positions of the large particles are shown to 
be the solutions of the N kinematic stochastic integral equations 

r a (t) = r a + [ [ g £ (r a (s)-q)w(dq,ds), a = l,...,N. (3.5) 

JO JR d 



The kernel, g £ , in Equations (|3.5| ) is induced by the force density field g £ „ of 
Equations ( |3.3| ) through the transition from a second order system in time 
(dynamic description) to a first order system in time (kinematic description). 
This transition requires an assumption that the force density field g £ has 
specific asymptotic behavior as the time-scale/friction parameter jjl gets large; 
specifically, there is a function r i— > g e (r) such that[^] 



9eA r ) ~ Me( r ) as » -► oo. (3.6) 

In the sequel, we refer to the function g £ as the [forcing kerncl| . p^| 



Comment 3.1 

Suppose that small particles move with different velocities. If most of the 
small particles moving in the direction of a large particle can avoid collisions 
with other small particles (as in a rare gas or in the PHS model of Section \^), 
fast small particles coming from 'far away" can collide with a given large 
particle at approximately the same time as slow small particles that were close 
to the large particle before the collision. If, in repeated microscopic time steps, 
collisions of a given small particle with the same large particle are negligible, 
then, in a mesoscopic time unit, the collision dynamics can be replaced by 
long-range mean field dynamics. Dealing with a wide range of velocities and 
working with discrete time steps, a long range force is generated. 

If, for instance, we assume that the empirical velocity distribution of the 
small particles is approximately Maxwellian, the aforementioned force den- 



13 0bserve that large values of /i correspond to short time-scales. 

14 If we suppose that the space-time white noise, w(dq,dt), has the units of volume times 



time ([=]L d T), the forcing kernel, g e , cannot have the units of a force density. 
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sity, g e », can be given by an expression in the form of Equation (Wb): (See 
Kotelenez $%j, Equation (1.2)^ 

\r-q\ 2 

9eA r - 1) ~ ^eA r - q) e 2e , ( 3 - 7 ) 

where \x is the friction /time- scale parameter associated with the large parti- 
cles and K Sy d is a normalizing constant chosen so that the particles become 
5— correlated in a standard way as e \ 0. 

In other words, if the above assumptions hold in a first approximation, the 
interactions between large and small particles are governed by a velocity field 
for which the variance of the distribution is the correlation length. Obviously, 
this example can be generalized to an arbitrary velocity field of the small 
particles. (A more realistic model might involve some, possibly nonlinear, 
transformation of the velocity field, taking into account collisions between 
small particles, etc.) For the purpose of our work here, it suffices to consider 



a general forcing kernel , as in Equations ( \3.5\) , and show that for certain 
kernels the right hand side behaves according to the requirements stated at 
the beginning of this section. 



MaxwelJ| 3.1 



We must choose the constant K £> d of Equation ( \3.Tj) in order to satisfy 



Desideratum [|. Later we will see that for the ^ 'axwell kerne\ , k £ ^ should be 



r| 



chosen so that if g e (r) := K e ^ve~~ in the kinetic stochastic Equation (3.5), 
then r i— > i|f7 e (r)| 2 must approximate the 5— function as e [ 0. This require- 
ment implies k 2 . A(ire) d / 2 = 1. This normalization implies thatr \— > ^|<7 £ (r)| 2 
is a probability density on M. d . We record for later use, that the variance of 
the associated probability distribution is 2 — . 

Of course, Desideratum [J requires that for any forcing function, r \— > 
\\g e {r)\ 2 must approximate the 5— function as e { 0. Later we will see that 
this is equivalent to requiring that, in the limit as e { ; the infinitessimal 
generator associated with the diffusion is one-half the Laplacian. (See Com- 



ment^7J\.) This, in turn, is equivalent to requiring that, in the limit as e [ 0, 
each particle experiences a standard Brownian motion and that the motions 
are independent. 



15 Clearly, the right-hand side of Equation (3.7) is of the form of Equation (3.1); that is, 
it is the negative gradient of a potential. 
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Comment 3.2 

The transition from the microscopic (dynamic) to the mesoscopic (kinematic) 

description is quite technical. See Kotelenez WN for the details. Heuristi- 



cally, however, an examination of the resulting kinematic Equations t \3.5]) 
reveals that one consequence of the procedure is to render negligible the in- 
ertial effect on each large particle due to its interactions with all the small 
particles. In the sequel our model will imply that the effect on each large 
particle by all the small particles will be, at most, fluctuations in the posi- 
tion of the large particle. An analysis of the mutual spatial correlations of 
these motions is the object of our work here. To describe these correlations 
and compare them with the depletion phenomenon we have here excluded the 
interactions between the large Brownian particles^ 



It is important to note that the system of Equations ( | . | ) is coupled 
only through the Gaussian space-time white noise w(dq, gLs).Q Regularity 
assumptions are assumed sufficient to guarantee that the integrals (Ito inte- 
grals) define continuous square integrable martingales and that, for the large 
class of kernels g £ we consider, each of the Equations ( |3.5| ) has a unique 
solution, which is itself a Brownian motion. In Section [5| we give a detailed 
analysis of the correlations between these motions when N = 2; that is, when 
there are two large particles. It will be shown that the joint motion of the 
pair is not Brownian. 

The transition from the microscopic description of Equations ( |3.3| , |3.4|) 
to the mesoscopic (stochastic) description of Equation fl3.5|) is accomplished 
through the following steps: 

• Form small clusters (ensembles) of particles, if their initial positions 
and velocities are similar (coarse graining in space). 



Replace the time derivative in Equations ( |3.3|) by an Euler scheme 
(coarse graining in time). 

Randomize the initial distribution of clusters, where the probability 
distribution is determined by the relative sizes of the clusters, assuming 



16 There is a large literature on interacting Brownian motions. In the context of coagu- 
lating Brownian particles refer to a recent paper by Hammond and Rczakhanlou |lS[| and 
the references therein. 

17 The space-time white noise w(dq, ds) is obtained in a scaling limit from the number of 
small particles, in the small volume dq and the small time interval ds, with a given large 
particle. See Kotelenez | ?j and the brief discussion here of the scaling limit. 
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statistical independence of the initial distributions of different clusters. 

Assume that the initial average velocity, (w ), of the small particles and 
the time-scale/friction coefficient fj, associated with the large particles 
both tend to infinity in such a way that y/efi <C (wo). 

Allow the small particles to escape to infinity after interacting with the 
large particles for a macroscopically small time.P] 



Carrying out these steps, Kotelenez | J shows that the positions of the 
large particles in a sequence of coarse grained versions of Equations ( |3.3| , |3.4|) 
tend to the solutions of Equations ( |3.5|) weakly in an appropriate space of 
functions with values in ~R dN , d > 2.Q We begin the substance of our work 
here with the motions governed by the system of Equations ( |3.5|) . 

Comment 3.3 

The escape to infinity after a short period of interaction with the large par- 
ticles is necessary to generate independent increments in the limit. This can 
be seen as follows: The small time induces a partition of the time axis into 
small time intervals. In each of the small time intervals the large particles 
are being displaced by the interaction with clusters of small particles. Note 
that the vast majority of small particles had previously not interacted with the 
large particles and that they disappear toward infinity after that time step. 
Since clusters have started independently, this implies almost independence 
of the displacements of the large particles in different time intervals. In a 
scaling limit, as the initial velocities of the small particles and the friction 
coefficient for the large particles tend to infinity, the motions of the large 
particles have independent increments. The requirements of an infinite num- 
ber of small particles and their eventual escape to infinity are both needed 
to obtain independent increments in time. A similar result is obtained if no 
friction term is introduced in the dynamic equations for the large particles. 
In this case, however, the limit is an Ornstein- Uhlenbeck model (described by 
Langevin equations), where the velocities, rather than the positions, perform 
correlated Brownian motions. 



18 This hypothesis seems to be acceptable if, for spatially extended particles, the in- 
terparticle distance is considerably greater than the diameter of a typical particle. The 
assumption holds for a gas (See Lifshits and Pitayevskii |£J, Ch.l, §3 ), but not for a 
liquid, like water. For a liquid, we refer to the PHS model, introduced in Section g. 

19 More precisely, in the Skorohod space of CADLAG functions with values in M. dN , d > 2. 
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Comment 3.4 

Klyatskin and Gurarie j^/ s/icw that a kinetic model of the form ^r{t) = 
f(r(t),t), where f is a field consisting of a deterministic term plus a random 
term, can exhibit clustering in simulations when the vector field f is com- 
pressive; that is, derivable from a potential. The vector field in our kinetic 
model is compressive. 

4 Preliminaries 

4.1 Notation 

In this section we set out the basic notational conventions used throughout 
the rest of our paper. 

The real numbers are denoted by M and the non-negative real numbers 
by M + . Scalars are always lightface symbols. In particular, t and s always 
denote times. Vectors and square matrices are denoted by boldface symbols. 
Generally, we use Latin minuscules for vectors and Latin majuscules for 
square matrices in the d- dimensional real Euclidean vector space M. d , d > 1. 
Unless otherwise stated, all matrices are vectors or square matrices; that is, 
(d x l)-matrices, (1 x (i)-matrices, or (d x rf)-matrices. If M denotes a matrix 
and x denotes a vector, M and x T denote their transposes. We say vector 
when we mean column vector; that is, x e JBL dxl and, hence, x T G M. lxd 
is a row vector. For a matrix, we say transformation when we mean to 
emphasize its role as a linear transformation. For two vectors, x, y, their 
inner (scalar) product is denoted by x • y and their outer (tensor) product 
is denoted by x <g> y. We have x T y = x • y, a scalar, and xy T = x <g> y, 
a square matrix. The norm, or length, of a vector x is \x\ := \fx • x. (We 
also write | — ^ | for the modulus of a scalar.) The inner (scalar) product of two 
matrices, M and N, is the scalar given by M • N := trace(MN ).F] The 
norm of a matrix induced by this inner product is the Hilbert- Schmidt norm 

\M\ := y/MmM. Q 

For y ^ 0, we write u(y) for the unit outward radial vector (at y): 

u(y) := ± (4.1) 

\y\ 



20 Note that trace(xy T ) — x T y or, cquivalcntly, trace(x (8) y) = x • y. 
21 It is topologically equivalent to the operator norm. ||Af ||, of the matrix as a linear 
operator on R d . 
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Any identity matrix is denoted by 1 and any matrix or vector whose en- 
tries are all zeros is denoted by 0. For y ^ we write P(y) and P (y) 
for the complementary orthogonal projections onto the y direction, the 
1— dimensional subspace {y}, and the hyperplane orthogonal to y, the 
(d — 1)— dimensional subspace {y} ± , given by0 

T T 

P(y):=yy- and P ± (y) := 1 - P(y) = 1 - ^-. (4.2) 

\y\ \y\ 

Note that, for y ^ 0, 
P(y).P(y) = 1, P ± (y)*P ± (y) = d-1, and P(y)*P ± (y) = 0. (4.3) 

The position vector of a large particle is denoted by r. In case there 
are more than one, the position vector of the ath large particle is denoted 
by r a . In general, large particles are indexed by Greek minuscules, so a, j3 = 
1,2, ... ,N. In M d , the standard coordinates of a vector r are indexed by 
Latin minuscules; thus, rj denotes the jth standard coordinate of r a .f^ 

The underlying probability space for all random variables is (f2, T ' , V). If 
Z denotes a random variable, Z = Z(cu), uj G Q, we suppress the dependence 
on uj, unless confusion is likely or emphasis is needed. We write E\Z\ to 
denote its expectation: E\Z~\ := J Q Z(u)V(dw). 

The norm of a function, /, (scalar- or vector-valued) in an appropriate 
L 2 setting is denoted by ||/||. If ^ i — s- f(() is a function of a scalar argument 
(scalar-, vector-, or tensor- valued) differentiable at ( = (, we write /'(C) f° r 

We adopt the following special convention: If (a? 1 , a? 2 ) is an (ordered) 
pair of vectors in IR d = M dxl then x is the vector in M? d = M? dxl given by 



x 



x L 



x 2 



4.2 Material Frame-IndifFerence 

The material descriptions that underlie our model are assumed to be invariant 
under changes in external observer. This fundamental requirement, known 



22 If necessary, we extend these definitions to include y — by requiring P(0) := 0, the 
projection on {0} and, hence, P (0) := 1, the projection on {0}- 1 . 

23 Coordinates are scalars, so the corresponding symbol is lightface. Standard means 
with respect to some prescribed orthonormal basis. 
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since the time of Stokes, is now called the Principle of Material Frame- 



Indifference. As formulated by Noll \\ 3], it asserts 



The constitutive laws governing the internal interactions between 
the parts of a system should not depend on whatever external 
frame of reference is used to describe them. 

In our context, this requirement imposes invariance relations on certain ma- 
terially significant scalar-, vector-, and matrix-valued functions of position. 
In the context of constitutive laws, functions restricted by this principle are 
often said to be frame-indifferent; mathematically, they are isotropic^ 

Definition 4.1 (Isotropic Functions) 

Let r — > ip(r), fir), and F{r) denote scalar-, vector-, and matrix-valued 
functions of position on M. d . They are isotropic functions whenever they 
satisfy: 

*l>{Qr) = 1>{r), (4.4) 

/(Or) = Qf(r), (4.5) 

F(Qr) = QF(r)Q T , (4.6) 

for all orthogonal transformations ((d x d)- orthogonal matrices) Q on IR d .|^] 
Isotropic functions must have very special forms. 



Lemma 4.1 (Representations for [Isotropic! Functions) 

1. A scalar-valued function of position, ip : M. d — > R, is \isotropic\ if and 



only if there is a scalar-valued function £ i— >• a(£) : R + — > M such that 

1>{r) = a(|rf); (4.7) 



24 There is an extensive literature on such functions in the context of mechanics. See, for 
example, the comprehensive article in the Encyclopedia of Physics - The Non-Linear Field 
Theories of Mechanics by Truesdell and Noll J4(J. In particular, they state and outline a 
proof of the fundamental Representation Theorem of Cauchy H on which Lemma 4.1 is 
based. (For details, see Truesdell and Noll @, §B,II,11.) 

25 The collection of (d x d) -orthogonal matrices, Q, is the orthogonal group, O d . The 
proper orthogonal group, Oi, is the sub-group that preserves orientation; these are the 
rotations. In particular, if Q € O d then detQ = ±1; while if Q <S O c \ then detQ = 1. 
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2. A vector-valued function of position, f : 
only if there is a scalar-valued function £ 



i — > M. d , is isotropic] if and 
/3(£) : M+ ->• R such tha^ 



f(r) = f3(\r\ 2 )r; and 



(4.8) 



3. A matrix-valued function of position, F : M. d - 
only if there are two scalar-valued functions £ i— * 
such tha^\ 

F(r) = 7(|r| 2 )l + r/(|r| 2 )rr :i 



ndxd 



IS 



\isotropi(\ if and 



7(0,7/(0: 



(4.9) 

A proof of this Lemma is provided in Appendix [S].0 We will also use the 
following alternate version Lemma ( |4.1|) , Part (||), employing the spectral 
decomposition and the projections of Equation (JO): 



3. (Alternate) A matrix-valued function of position, F : R d — ► 
M dxa! , is |zso£rojP2'c| z/ and only if there are two scalar-valued 
functions £ i-f A(f), A_i_(£) : M + ->■ R s^c/i ^o<0 



F(r) = A(|rf )P(r) + A ± (|r| 2 )P ± (r). 



(4.10) 



In this version A(|r| 2 ) and Aj_(|r| 2 ) are the eigenvalues of F(r) with cor- 
responding eigenspaces {r} and {^j^-Q If these eigenvalues are distinct, 
then A(|r| 2 ) is a simple eigenvalue; if not, F(r) = A(|r| 2 )l = Aj_(|r| 2 )l. 
The latter occurs if and only if in addition to being isotropic, F satisfies: 
F(r) = F(Qr), for all orthogonal Q.0 



26 [sotropy here requires that /(0) = 0. 

27 [sotropy here requires that -F(O) = rel, for some constant k. This is a special case of 
the well-known result that a matrix commutes with all orthogonal matrices if, and only if, 
it is a multiple of the identity. See Theorem A. 4 and the discussion in Appendi x \P\ This 
is implicit in Equations (4.9) and (4.1C). Note also that the conclusion of Part |3| implies 
that a isotropic matrix function is symmetric. 

28 If the underlying vector space has dimension d = 1, the invariance of statements in 
Equations (4.4 — 14.6| ) are not all the same; the statements of Equations (4.4) and (4.G) 
are the same and imply that the function in question is even while that of Equation (4.5) 
implies that the function in question is odd. 

29 O bserve th at the complementary orthogonal projections P{r) and P (r) are them- 
selves isotropic. 

30 Here, {r} denotes the subspace spanned by r and {r}- 1 denotes its orthogonal com- 
pliment. 

31 In this case F(r) commu tes w ith all orthogonal transformations, so the result follows 
independently from Theorem A. 4 in Appendix |A|. 
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5 One- and Two-Particle Systems 

Henceforth we will track a single particle, with position vector r, or an or- 
dered pair of particles, (r a ,rP). Following our convention, the pair-position 
vector r is the vector in M 2d with the block form 

ll) ■ (5-1) 

In this context, the indices will always take the values a, (3 = 1,2. We begin 
with some basics and the fundamental one-particle system. 

5.1 The Fundamental Kinetic Equation 

Suppose the random position vector of a large particle, r, depends on time, 
t i— > r{t). From the stochastic limit, described in Section [|, we begin with 
the assumption that the random position vector of a single large particle 
obeys a kinematic equation of the form: 

dr{t)= I g(r(t)-q)w(dq,dt). (5.2) 

This is a stochastic integral equation of Ito type. The integrator, w(dq, dt), is 
standard space-time Gaussian white noisej^] which represents the influence 
of the medium of small particles upon the large particles. The nature of the 
forcing on the large particles is determined through the forcing kernel g. 

The |forcing kernei| g : M. d — > M. d ;r i— > g(r), is a vector- valued function 
of position derived from the underlying physics governing the interactions 
between large and small particles. In effect, g can be taken proportional to 
the negative spatial gradient of the velocity distribution of the small parti- 
cles.0 It plays the role of a constitutive function in Equation (|5.2| ) and in 
the sequel. As required by the |Principle of Material Frame-Indifference ,| the 



distributions implicit in the derivation of Equation ( |5.2|) must be indepen- 
dent under changes in external observer. Consequently, we start with the 
following.^ 

32 See Walsh g| 

33 If ip(r) is a differcntiable scalar-valued isotropic function of position r, then its gradient 
V r ^'(r) is a isotropic vector-valued function of position. In particular, V r 7(|r| 2 ) = 
2'y'(\r\ 2 )r. The converse is also true. 

34 For convenience, we suppress the subscript e of g in Section except when a specific 



form of the forcing kernel is used. 
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Hypothesis 5.1 ( [Isotropy] of the |Forcing Kernel]) 



The forcing kernel , g, is isotropic: 

g(Qr) = Qg(r), (5.3) 

for all orthogonal transformations Q in WL d . 



By this hypothesis and Lemma (|4.1| ), there is a scalar function, called the 
forcing function, (p '■ K + - ► R; £ l—> 0(0 such thatQ 

g(r) = (f>(\r\ 2 )r. (5.4) 



Hypothesis 5.2 (Regularity of the forcing" Kernel|) 



The scalar-valued forcing function <j) in Equation ( 5~\ ) is positive, decreasing, 
and sufficiently regular so that the forcing kerne\ , g, satisfies: 

1. g is twice continuously differentiable, 

2. all partial derivatives up through order 2 of all components of g are 
square integrable (over M. d ), 

3- \g\ n , 1 < n < 4 is integrable (over M. d ), and 

4. lim| r |_K JO |0(r)| = 0. 

For a distinct pair of large particles indexed by a and (3, (a 7^ (3), we 
posit the following two-particle kinematic system 

dra (t)\ (L d 9(r a (t) - q)w(dq,dt)\ 

dr(3 (t)J " Ur- 9(r^(t) - q)w(d qj dt)J ' [ ^ } 



The system of Equations ( |5.5|) is presented as two copies of equation (|5.2|) 
and, as such, does not appear to be coupled. Nevertheless, the two equa- 
tions are stochastically coupled through the common white noise integrator, 
w(dq,dt), which represents the medium of small particles in which the two 
large particles move.Q If we adjoin a (random) initial state 



rf>(0)) ~ U 



(5.6) 



35 If r 1— > <^(|r| 2 ) is the distribution function for the velocities of the small particles, then 
(|r| 2 ) is proportional to — 2<p'(|r| 2 ). 
36 The nature of this coupling will be made explicit in Section 
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to Equations (|5.5|) , the corresponding initial- value problem can be shown to 
have a unique solution t i— > r(t), t > 0, which is a Markov process in M M .[^] 
We express the initial-value problem (|5.5|, |5\6|) succinctly by 



where 



(5.7) 



dr(t) = / g(r(t) - q)w(dq, dt),t> 0, 
f(0) = f , 



•»>-•« ' 5 ))-ft$i- <"> 



In Equation ( |5~7D , and below in Equation (|5.10|) , q:= I J , which emphasizes 



that the white noise integrator is the same for both large particles. 

Comment 5.1 

We systematically consider a solvent containing just two large particles. But 
the structure we propose can be extended in a natural way to include any 
finite number of large particles. 



For a single large particle, indexed by a, Equation (|5.2|), together with 
the initial condition Vq, is the same as the stochastic integral equation^] 



r a (t) =r% + J J^ d [9(r a (s) - q)\ w(dq, ds). (5.9) 

For the pair of distinct large particles, the initial value problem, given by 
the system of Equations fl5.7p , is equivalent to the single stochastic integral 
equation 

r(t)=r +[ J (g(r(s)-q))w(dq,ds). (5.10) 

For convenience, we write m a (t) for the stochastic integral involving the 
process r a (-) on the right-hand side of Equation ( |5.9| ); thus0 

m a (t):= [ [ (g(r a (s)-q))w(dq,ds). (5.11) 



JM. d 



37 Kotclenez |s[ |g 

38 Provided the driving noise, w(dq,dt), and the initial data, Tq , are independent. 



39 0ur assumptions imply that the integrals defining the process m a (-) in ( 5.11 ) are 
continuous, square-integrable, martingales whenever the processes r a (-) are adapted (by 
their histories). The letter m signifies martingale. 
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If the process r a (-) in Equation (jcT 1]) is a solution to the evolutionary sys- 
tem Equation ( |5.9|) with random initial condition t*q, we replace r a (s) in 



Equation (^TTj) with r a (s,r^) in which case m a (t) should properly be re- 
placed by m a (t, t*q), etc. Generally, we will suppress this dependence on the 
random initial data. 

Comment 5.2 

It is important to bear in mind that in our model the space-time white 
noise, w(dq,dt), is obtained as a scaling limit of small particle velocities 
acting in a short time scale in a small box on the velocities of the large parti- 
cles^ The significant difference between our model and the traditional one is 
that in the traditional model each large particle is driven by its own indepen- 
dent Brownian motion, whereas in our model the large particles are driven by 
a Brownian medium, which is the same medium for all large particles. The 
marginal distributions associated with Brownian medium^ for each large par- 
ticle is a traditional Brownian motion, but the joint distribution of two or 
more large particles is not Gaussian and, hence, not Brownian. 



To make some computations more specific, we will use a |forcing kernel| of 
the following specific form: 

g £ (r) = <p e (\r\ 2 )r = n e<d e~~r, (5.12) 

where \fe is a correlation length and k £ ^ is a constant that depends on e 
and the physical dimension <i.0 This special form of g is induced by as- 
suming a Maxwell distribution for the velocities of the small particles, which 
seems physically plausible. Note that the kernel g e is of the form given in 
Equation ( |5.4|) (or Equation ( |3.7|) ) whose scalar part, the forcing function, is 

£ i— > e (£) = K e ,de~27, so it is |isotropic~ G We call g £ the Maxwell kernel. In 



the sequel we continue to use the symbol g, reserving g e specifically for the 
Maxwell kernel of Equation ( |5.12| ). 



Return to the unique smooth forward flow induced by the initial value 
problem of Equations (|5.5| , |B~6|) (or the equivalent Equation (|5.10|) ). Since 



40 Kotelenez |27| 

41 These are obtained by convoluti on o f g(-) with w(d(-),dt) and the initial conditions. 
See also our discussion in Appendix |B.2|. 



42 See Commen t 3.1 . 

43 The isotropy is expected, since this formulation is based upon the basic physical laws 
and material assumptions governing the interaction of the small and large particles. Similar 
structures should obtain for any reasonable (unimodal) distribution. 

July 31, 2008 - 13h:54m 



5 ONE- AND TWO-PARTICLE SYSTEMS 37 



the two equations in Equation fl5.5| ) are the same, namely copies of Equa- 
tion ( |5.2|) , they each generate the same flow in M. d distinguished only through 
the initial condition for the particle r initially located at r% : 

t^r a (t):=r(t,r^). (5.13) 

For a pair of large particles, we have the induced flow in M? d of Equa- 
tion (15301) : 

t\->r{t):=r{t,r ). (5.14) 

Whenever we need to emphasize the role of the driving noise w(dq, dt), we 
write r(t, t*q ; w) for r(t, Tq); similarly for the pair-process, we write r(t, f o; w) 
for r(t, fo), where the driving noise w(dq, dt) is the same for both particles. 

Comment 5.3 

The significance of the conclusion above is its connection with the motion of a 
large particle in the interacting particle system of many small and some large 
particles. The forward flow described above for large particles is a stochastic 
limit of the motion of the large particles as the number of small particles 
becomes infinite; the evolution of the positions of the large particles follows 
an Einstein- Smoluchow ski model.f^ 

We proceed to describe the properties of this stochastic limit. In par- 
ticular, we will characterize the joint probability density in IR d for a pair of 
distinct large particles. 

5.2 Properties of the Stochastic Limit 

As emphasized earlier, the motions of each large particle (given by an ap- 
propriate marginal of the solutions to the system in Equation ( p.lOQ Pl) is 
Brownian, provided the initial state is deterministic; however, we will see 
that the joint motion of the pair is not Brownian. 

First we observe that a pair of large particles that are initially distinct will 
almost never coincide. More precisely, an argument due to Dawson shows 



44 The limit is distributional in the sense that, for each a, the distribution associated 
with the motion of the large particle, !•"(•), in the presence of the background of n small 
particles, r"(-), (weakly) approaches those associated with r Q (-) asn-^ oo. (Sec Kote- 
lencz (27|.) 



45 



These are described in Appendix B.2 
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that if] 

E[|r Q (0)-r /3 (0)r 2 ] >0, (5.15) 

then, for any T > 0, 

V\{uj E n : 3t e [0, T] such that r°(t, w) = ^(t, w)}] = : (5.16) 

T/ie probability is zero that two particles, initially distinct, ever coincide in 
a finite time interval. 

Define the (d x d)— matrix C by 



C : 



g{q)g T {q)dq- 



(5.17) 



Now consider the process m a (-), defined in Equation ( |5.11| ). By the 
translation- invariance of the integrals, it follows that, for each a, m a (t) is 
a <i-dimensional Brownian motion with incremental covariance matrix C.Q 
By its construction, C is non-negative definite and symmetric. It will follow, 
from Lemma |5"!2| , that the isotropy of g guarantees that C is positive definite 
(non-degenerate) . 

For a process consisting of a pair of particles, (r Q (-), r^(-)), the 
cross quadratic variation of the processes m a (-) with m^(-), denoted by 
((m a , mr ))(-), is a well-defined (d x d)-matrix valued process. Definitions of 
quadratic variation and cross quadratic variation are given in Appendix E_3. 

The properties of the white noise, w(dq, ds), imply 

((m a ,m^))(t)= ! [ g(r a (s)-q)g T (r^s)-q)dqds, (5.18) 

JO JR d 



which, using shift-invariance, is the same as 



[m^m^)){t)= f I g(( 
Jo Ju d 



r a (s)-rP(s) 



q)g T {-q)dqds. (5.19) 



The essence of the computation leading to Equation ( |5.18| ) together with 
a discussion of some of the consequences that follow are also supplied in 
Appendix | ^. Thus, assuming that the two large particles were initially 



46 Privatc communication. 

47 This follows from a d-dimcnsional version of P aul Levy's T heore m (see Ethier and 
Kurtz |HJ, Chapter 7, Theorem 1.1 or Theorem BJ in Appendix B.3 .) 
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distinct in the sense that assumption ([T ^) holds, the quadratic variation 
((m a , m^))(-) depends only on the process that is the difference between the 
positions of the two large particles, namely r a (-) — r^(-). More important, 
((m a , m^))(-) cannot vanish on any interval, which implies that the processes 
m a (-) and m /3 (-) are correlated on every interval.0 If the particles coincide, 
so a = (3, then ((m a ,m l3 ))(t) = ((m a ))(t) = ((m p ))(t) == tC = tcl, where 
((m)) denotes the quadratic variation of the process m.f^\ 

Equation ( 5.18Q further implies that, in general, the joint motion of two, 



initially distinct, large particles cannot be Gaussian and, a fortiori, cannot 
be Brownian. However, the motion of each large particle in the joint motion 
is Brownian if viewed as a d-dimensional marginal process with determin- 
istic initial conditions.^ This becomes transparent when we examine the 
covariance process for the joint motion. 

nr*h 



If r - [ ^p} is a pair in M? d = W 1 x M. d corresponding to two large 



particles, define D a ^(r) to be the (d x <i)-matrix function on 



i 2d = R d x R d 



D al3 (r):= [ g(r a -q)g T (r 13 -q)dq. (5.20) 

Using shift invariance, this is the same as 

D aP (r)= [ g{{r a -r p )-q)g T (-q)dq] (5.21) 

JR d 



SO 



r P i \ ~ ~ U- r <3 



jy, r = ^ " (5-22) 



and D a ^ is a function only of the difference (r a — r@) E R d . Observe that, 
from Equation ( [5.18| ), we have 



(m a , m p )){t, f „) = / D al3 (r(s, f „)) ds, (5.23) 

Jo 



48 The fact that the matrix function ((m a , m ,3 ))(-) cannot vanish on any interval follows 
from the characterization of D al in Appendix 5.4 and the formula in Equation (5.23)below. 



In particular, the lateral component of D a ' can never vanish. 

49 See Appendix B.3, especially Theorem B.l. We show below that C = cl, for some 



c>0. 

50 See the discussions in Appendices B.2 and B.2 
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provided f(-, f*o) is the pair-process determined by the stochastic evolutionary 
system in Equation (|5.10| ). 



Using Equation (|5^ i!J) we see that if the two particles coincide, r a = r 13 , 

= C, where C is the constant, symmetric, pos- 



we have D aa (r) 



LW 



itive definite covariance matrix associated with a single large particle given 
in Equation ( 5.17 ). The isotropy of g implies that C satisfies: QCQ T -= C 



for all orthogonal transformations Q on M d ; that is, C commutes with 
all orthogonal transformations. It follows from the Theorem (|A.4|) in Ap- 
pendix [A] that C is a constant, positive multiple of the identity matrix; that 
is, C = cl, c > 0.Q Again, from Equation ( |5.22| ), if the two particles are 
distinct, r a ^ r 13 , the matrix D al3 (r) is generally not constant in f, for it 
depends specifically on the difference, r a — r 13 , between the positions of the 
two large particles. 

From its definition, Equation fl5.20p , we see that structurally (D af3 ) T = 
D Pa . We have already observed that when a = p, D aa = D w = = C 



is symmetric. Less obvious is the fact that |isotropy| implies each D 

symmetric when a ^ (3. Thus 

(D a P) T = D^ a = D a/3 



ia/3 



IS 



(5.24) 



To reveal this symmetry, use shift invariance in the definition Equation fl5~20l), 
or any of the equivalent versions that follow, to obtain 



D a %r) = j d g(\(r a ~ r?) - q)g T [ - l -{r a - r") - q) dq. 



,2 V ' V J V 2 

Now jisotropy implies that g{—y) = —g(y), so Equation ( p. 25 ) becomes 



(5.25) 



D 



a/3 1 



g[q-^ 



9 



Structurally, its transpose is 



'D af3 ) T (r) 



g + _( r «_ r /3)|. 



q + ^ r 



q (r 

y 2 V 



dq. 



(5.26) 



r p ))dq. (5.27) 



Finally, by changing the integration variable q in Equation ( 5.27|) to — q we 
recover D al3 (r) in Equation (|5.25|) . 



51 This gives another argument using ^sotropy that C is positive definite; for if c = 
then C = 0, which is impossible. 
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For two particles (r a , r a ) we define the corresponding (2dx2<i)-covariance 
matrix D(r), given in block form byPl 



D(r) 



D aa (r) D a0 (r 
D Pa (r) DW(r] 

C B°*{r) . 
D a ?(r) C 

c\ D a(3 (r] 
D af3 {r) c\ 

Structurally, the matrix D(r) must be symmetric and non-negative definite; 
in addition, each block is symmetric. In Lemma |5^3, we show that whenever 
the two particles are distinct, r a ^ r@ the matrix D{r) is positive definite 
(non-degenerate). Moreover, under our assumptions, D a ^(r) — > or, equiv- 
alently, D(r) — > c\ as \r a — r 13 ] —^ oo. Observe that the diagonal blocks of 
D{r) are constant but the off-diagonal blocks (cross terms) are non-constant 
and non-zero in r; they depend specifically on the difference r a — r 13 . 

Comment 5.4 



For any kernel, under the normalization of Comment [3 J], c = ||C|| = 1, 
the operator norm of C . The Hilbert- Schmidt matrix norm of C is then 
\C\ = v ' C • C = c\fd = y/d. The covariance matrix then has the form 

D(r) = ( D i ( „ D 7>) . (5.29) 

And, in the limit as \fe \ 0, D al3 (r) — > or, equivalently, D(r) — ► 1. 

5.3 The Generator for the Difference Process 

Recall that the pair-process t >— > r(t,r ) is the solution to the stochastic 
evolutionary system in Equation ( |5.10| ); it is a homogeneous Markov process 



such that f(0, fo) = r q. This process is associated with a semigroup of linear 
operators {T t : t > 0} through its transition probability function P: 

(t, x, B) ^ P(t, x, B) := V[r(t, r ) e B\r = x]. (5.30) 



r ,2 



Note that D(r) = f Rd g(r — q)g (r — q) dq, where q 
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That is, (t, x, B) — > P(t, x, B) gives the conditional probability that the pair- 
process r(t, fo) lies in the Borel set B at time t given that its state at time 
t = was r = cc.P| Let / be a bounded, measurable, real-valued function 
on M. 2d . Define T t f by the conditional expectation given through 

(fJ)(x)=E A {f(r(t,r ))}:=E[f(r(t,r ))\r = x] = [ f(y)P(t,x,dy); 

JR 2d 

, (5-31) 
so (T t f)(x) is the expected value of f(r(t, f )) given that r (0, r ) = r = x. 
In particular, if / = 1#, the indicator function of the Borel set B, then 
(T t le)(cc) = P(t,x,B), which means that the transition probability can be 
recovered from the semigroup. 

The (infinitesimal) generator, A, of this semigroup operating on / at x 
is the mean instantaneous rate of change of f(r(t,ro)) at t = given that 
f(O,f- o ) = x:0 

(Af)(x) := ]im±((fj)(x) - /(&)). (5.32) 

The generator, A, of the semigroup {T t : t > 0} is determined explicitly 
through the covariance matrix D(x) of Equation ( |5.28| ). 



Theorem 5.1 (Generator of the semigroup {T t : t > 0}) 

The generator, A, of the semigroup {T t : t > 0} is determined by the second 
order elliptic differential operator, defined for twice continuously differen- 
tiable functions f on M? d that vanish at infinity, by^\ 

2d 



1 / ff 

ov)(*) = = 5 E^.(*)(^^/ 

l,m=l x 



1\ \ / *2 A / /I 



a,/3=lij=l v v 7 7 \ *-""i *-""j 



X 



(5.33) 

2 



53 Since the process is homogeneous with respect to time, the probability does not depend 
on the absolute time, but only on the time interval t; that is, 

(i, x, B) .-» P{t, x, B) := V[r(s + t, f ) € B\f(s, r ) = 4], 

for any t, s > 0. Here, i? denotes an arbitrary Borel subset of R 2d . 

The limit, whenever it exists, is strong in the sense that it is with respect to the norm 

SUp £eR 2d{|/(&)|}. 

55 The domain of A, T> (A), densely contains the twice continuously differentiable func- 
that vanish at infinity, C^(R 2d ,IR). See Ethicr and Kurtz Q. 
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fore 



where D(x) is the covariance matrix defined in Equation ( 5.28 ). Therefi 
we refer to D(x) as the diffusion matrix for the pair-process. 

An outline of the proof of this result is given in Appendix |C|. 

Comment 5.5 

The normalization that we have used to comply with Desideratum ^ is equiva- 
lent to the requirement that C = 1 and, in the limit as e j 0, the infinites simal 
generator A becomes one half the Laplacian. That is, D a ^ — ► 0, as e [ 0. 
In other words, in this limit, each particle experiences a standard Brownian 
motion and these motions are independent. 

We could replace the requirement that the limititng generator be one half 
the Laplacian with the requirement that it be some other constant times the 
Laplacian, say Einstein's diffusion constant, D, times the Laplacian. Such a 
re-normalization will have no effect on our results. 

We now focus on the difference-process (r 2 (-)— r x (-)) in M. d . The generator 
for this process can be extracted from the generator A for the process in 
M. 2d , defined in Equation ( |5.33| ), by means of an orthogonal transformation 
(rotation) in M, 2d followed by a projection. Here are the main steps. Define 
the (2d x 2d) proper orthogonal matrix (rotation) R in block form by 

*=^(-i i) (5 ' 34) 

and the (2d x 2d) projection matrix P by the block form 



The matrix P determines the projection in M? d onto the subspace 



x L 



X 



M? d : x 1 = >, which we will identify with M. d . That is, we identify (<-►) 

the M 2d vector I , J with the M. d vector b. In particular, we extract the 

(normalized) difference -4= (r 2 — r 1 ) from R through the rotation followed by 
the projection: 

— (r 2 -r l )^PRr. (5.36) 

v2 
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Write x = Rx for rotated vectors x in M. 2d . Then the semigroup {T t : t > 0} 
induces the "rotated" semigroup {T t : t > 0} whose "rotated" generator we 
denote by A. We will obtain a generator, A, for the difference-process by 
extracting that part of A associated with the difference-process. It is conve- 
nient, and perhaps more illuminating, to work directly through the original 
transition function, P, instead of the transition function, P, associated with 
the rotated semigroup, {T t : t > 0}. 

For any Borel set B in M. d , consider the cylinder set M. d x B in M? d . Write 
T B ■= R T (R d x B) for the rotated cylinder set. Now P(t,r,T B ) is the 
probability that the two particle system, which started at f at time t — 0, 
lies in T# at time t, so 

P(t, f o, T B ) = V[r(t, f ) e R T (R d x B) |f(0, f ) = f ] 
1 



P 



P 



r 1 ^^ 



72 V^M)-^,^ 



x B] 



(5.37) 



L ^M*,rS) 



* l (t,rl))EB 



Recall that the solutions r 1 ^,^) and r 2 (-,rl) have the same driving noise 
w(dr, dt), which we emphasize by writing r 1 (-, rj; w) and r* 2 (-, r 



Two random variables, say X and F, are said to be equivalent, written 
X ~ Y, whenever they have the same distribution. The next result shows 
that the pair-process has the following restricted translation invar iancef^l 

Lemma 5.1 

For all h in R d 






r 2 {-,rl + h:w\ 



(5.38) 



considered as C([0, oo),~R 2d ) -valued random variables. 



As a consequence of Lemma ( |5.1| ), the transition probability in Equa- 
tion ( |5.37|) depends only upon the (normalized) difference x := 4j(^o — r o)- 
Therefore, suppressing the zero subscript, we can define the following tran- 
sition probability: 



P(t,x,B):=P(t,r,r B ) 



-^=(r 2 -r 1 )=x 



P[t,R 



r, 



(5.39) 



5(i 



Kotelenez \1 
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The transition probability, P, is that part of the transition probability, P 
(or P), associated with the difference-pro cess .f 7 ] We now obtain the gener- 
ator, A, associated with P. It will turn out to be that obtained from A 
through the change-of-variables given by the rotation followed by projection. 
Following our previous notational scheme, if / G CQ(R 2,i ,R), then / G 

C 2 (R M ,R) is just / = / o R. Furthermore, if f( K) \ = f( (? 2 

for all x x ,x 2 G R d , we identify (<->•) the latter with the element f{x 2 ) in 
C 2 (R<*,R). 

Using Definition (|5.20|) , define the (d x d)-matrix valued D function on 

R d by 

tXx) := D" ( R T (°\\ - dMr T (°)) = C - D-0- h' 



*JLs If \ \ tJU j J \ \ / / \ *xj 

(5.40) 
Henceforth, we call D{x) the diffusion matrix for the difference-process. 
From the definition of D in Equation (^ !^) and the comments following 



it, recall that D n (r) = D 22 (r) — C, a constant matrix, and D 2l {r) - 
D 12 (r), where the latter matrix is not constant and depends only on the 
normalized difference x := -4= (r 2 — r 1 ). So D(x) defined in Equation (|5.40|) 
is 

D(x) = (PR)d(r (°) ) (PR) T , (5.41) 



the rotation and projection of D. It is important to note that the |isotro 
of g implies that the diffusion matrix D(x) is isotropic: 

D(x) = Q T D(Qx)Q, (5.42) 

for every x in M. d and every orthogonal transformation Q on M. d . Putting all 
this together yields 

Theorem 5.2 (The Diffusion Matrix for the Difference-Process) 

The transition probability function (t,x,B) i— > P(t,x,B), defined in Equa- 



tion (\5.3£\), generates the difference diffusion process, a Markov-Feller pro- 



cess in R whose generator, A, is given by the second order elliptic operator 



57 



The transition probability P may also be considered a marginal transition probability. 
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16 



(Af)(x) 






d 2 



dxjdxi 



fm 



(5.43) 



where the \diffusion matrix for the difference- process] D(x), is defined in 
Equation t\5.4Q ) (or, equivalently, in Equation (\5.4A ))- 



Observe that the differential operator in Equation Q5.43Q is precisely that 
which would be obtained from the formula in Equation ( |5.33| ) by the change- 
of-variables given through the rotation followed by the projection. Indeed, 
if we change coordinates in ~R 2d by the rotation, x — » x = Rx in Equa- 
tion (|5.33| ) and denote the differential operator A, now expressed in the 
rotated coordinates, by A, we get 



(Af)(x) 



1 2d 

- J2 Dlm(x) 

£,TO=1 

d 



d 2 



dXidx m 
1 



/ (*) 






X 



-x 

x 1 



-?-J 

dxjdxj 



d 2 



dxfdx 2 



f 



(5.44) 



where D(x) := RD(R x)R is the block-diagonal matrix valued function 
of x — ( | 2 ) ) given by 



D(x) 



C + D 12 * 








C-D 




12/ 1 



(5.45) 
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Since D depends only on x 2 , we may take x 1 = O.PI Thus, 



D 12 (Rx) 



^ff;S)-^W> ^ 



A must be the generator of the rotated semigroup {T t : t > 0}. Finally, if we 
apply A to functions of the form / and call the result Af we recover Equa- 
tion (5. t3|), provided the symbol x 2 is replaced by x. We need Lemma ( |5.1|) 



to justify this last step, namely that we indeed have the generator of P. 

Definition 5.1 

Henceforth, x will denote the normalized difference x := -j^(y 2 — r 1 ) and £ 
will denote its magnitude: £ := |je|. We also use the term separation for this 
magnitude. 

Since r -> D 12 {r) = D 12 ( ( V 2 J J = D 12 ( ( 2 ° x J j depends only on a:, 



r l l \ \r — r 

we will usually identify it (<->•) with a function x —> D(x); that is 



.12 / l I ~ x \ \ nl2 / ' 



^^Ivil.JJ^UviJJ (547) 

Earlier we asserted that whenever r 1 7^ r 2 the matrix D(r) was positive 
definite. It is equivalent to show that this assertion is true for the block 
diagonal matrix D(r). The following Lemma, proved in Appendix |E|, suffices. 

Lemma 5.2 

The two (d x d)-blocks C ± D 12 {r) ^> C ± D(x) are each positive definite 
whenever x ^ 0.0 

5.4 The Diffusion Matrix for the Difference Process 



Henceforth we will be concerned exclusively with the |diffusion matrix 



|difference-process| or, succinctly, the [diffusion matrixj ; that is, with the matrix 



function x 1— > D(x) := C — D(x). Earlier we saw that the |isotropy| of the 



forcing kernel| g implied that C had the form C = cl, for some constant 



58 Recall that x 1 = -75 (r 1 + r 2 )) and x 2 = 775 (V 1 - r 2 ) 

59 Of course, if r 1 =r 2 , then C + D 12 = 2C is positive definite while C - D 12 = 0, 
which case D is non-negative definite but not positive definite. 
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> 0. We now use the |isotropy| of g to reveal the deeper structure of D(x). 



We know from Equation Q5.42Q , that it, too, is isotropic; but we go further. 



Using Equation ( |5.20| ) and x of Definition |5.1| we easily see that 

V 2 \ T I v 2 



D(x) = J ^ g[q+ JL_as J g 1 \q - yxl dq. (5.48) 

For convenience, temporarily set x = \/2y and use the representation of g 
in Equation (p.4|), to get 



r>(V22/) = [ (q + y)(q- y) T <P(\q + y\ 2 )<P(\q - y?) dq. (5.49) 



Since we have already shown that D(\/2y) is symmetric, Equation ( |5.49| ) 
reduces to 

D(V2y)= [ qq T <P(\q+y\ 2 )<P(\q-y\ 2 )dq-yy T [ <f)(\q+y\ 2 )<P(\q-y\ 2 ) dq. 

JR d JR d 

(5.50) 
Define a symmetric-matrix-valued function of y by 

A(y) := [ qq T <P(\q + y\ 2 )<P(\q ~ y?) dq (5.51) 

and a scalar- valued function of y by 

6(i/) := f <K\q+ y\ 2 )<P(\q - y\ 2 ) dq, (5.52) 

JR d 

so that 

D^y) = A(y) - b(y)yy T . (5.53) 

A bit of algebra and a change-in-variables yields the following: For any or- 
thogonal transformation Q on M. d we have 

QA(y)Q T = A(Qy) and b(y) = b(Qy); (5.54) 

so each is isotropic in the sense of Definition ( |4.1|) . From Lemma ( |4.1| ) (using 
the alternate form of Part ([3])) we see that A(y) and b(y) must have the 
form 

A(y) = a(\y\ 2 )P(y) + a ± (\y\ 2 )P ± (y) and b(y) = (3{\y\ 2 ) (5.55) 
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for some scalar-valued functions a, a±, [3 : R + — > R; £ i— > a(£ 2 ), a_i_(£ 2 ), /3(£ 2 ). 
Since is non-negative and not identically zero, A(y) is positive definite, so 
we can also conclude that, for every y ^ 0, 

a(\y\ 2 )>0 and a ± (\y\ 2 ) > 0. (5.56) 

At y = 

a(0):=c>0 and a ± (0) = c> 0. (5.57) 

Of course, (3{\y\ 2 ) > for all y, as well. Our regularity assumptions on the 
forcing kernel| g, Hypotheses |5.2| , also imply the limits 



lim a {\y\ 2 ) = 0, lim a ± {\y\ 2 ) = 0, and lim |y| 2 /3(|?/| 2 ) = 0. 

(5.58) 
Combining the results above, we see that .D has the special form 

D(V2y) = a ± (\y\ 2 )P ± (y) + (a(\y\ 2 ) - \y\ 2 /3(\y\ 2 ))P(y). (5.59) 

Putting it all together we see that the [diffusion matrixj, D(x), of Equa- 



tion ( |5.41| ) must have the form given in the following. 



Theorem 5.3 (Structure of the [Diffusion Matrix] ) 



The diffusion matrvj\ D(x) = C — D(x) has the form 

D(x) = a ± (\x\ 2 )P ± (x) + a(|a;| 2 )P(a;), (5.60) 



where a, a± : R + — ► R are given in terms of a, a± and (3 by 
i\ i2\ / i |2 

<t_l(|cc| ) = c- a±\-\x\ 



(5.61) 



We also have 

< cr±(£ 2 ) < c, for f > 0, a ± (0) = 0, lim cr ± (£ 2 ) = c, (5.62) 

£— >oo 

and 

< a(£ 2 ), /or £ > 0, flr(O) = 0, lim a(£ 2 ) = c. (5.63) 



f— >oo 
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Note that <t(|:e| 2 ) and crj_(|a;| 2 ) are the eigenvalues of D(x). Unless they are 
equal, cx(|:r| 2 ) is a simple eigenvalue and <7_i_(|a?| 2 ) has geometric multiplicity 
d—1. We refer to a as the radial eigenvalue and a± as the lateral eigenvalue. 

Definition 5.2 

We say the \diffusion matri^ D is radially dominant whenever its eigenvalues 
satisfy: 

0j_(£ 2 ) < <r(£ 2 ), V£>0. (5.64) 

Radial dominance of D will be shown later to guarantee that the difference- 
process is recurrent in two dimensions. The next result gives a sufficient 
condition for radial dominance in terms of the |forcing function! 0. 



Theorem 5.4 

If, in addition to the Hypothesis [? !1], the scalar-valued forcing function <p 
satisfies the logarithmic convexity condition (ln0)" < ; the \diffusion matritf 
is \radially dominant^] 



This result is proved in in Appendix | 



The general properties for D(x) are all typified by the |5 
for which D £ (x) can be computed explicitly. 



Maxwell 5.1 



For the [? 'axwell kernel g e , given by Equation ^5.12^) , the 
D £ (x), defined in Equation §5. Uty, has the explicit form: 



usion matria 



D e (x) 



C £ - D e (x) 

I 1 



(x) + a e (\x\ 2 )P(x) 
\P ± (x) + c e4 



-2T + 



\X\ 



P(x) 



(5.65) 



where 



c e ,d 



2 «M 



d 



(5.66) 



(50 r 



•'This condition for guarantying Radial dominantcc is a bit strong. See Kotclcnez [E 
for a somewhat weaker condition. The logarithmic convexity condition also guarantees 
that the functions £ \-> a(£ 2 ) and £ i— > aj_(£ 2 ) are strictly decreasing. 
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In Equation ( 5.61. ) 



112 o 

\x\ I s0 ' 



o- £ ±(\x\ ) = c e A 1-e 2= J and ct £ (|cc| 2 ) = c M ( 1 - e * H e 2= 

(5-67) 
T/ie 7 r axwell kerne\ clearly induces a positive-definite, symmetric, radially 



dominant \d,iffusion matrij\, provided x ^ 0. Finally, if we invoke the nor- 



malization of Comment [5 J], we /iave 

c M = 1. (5.68) 

We close this section with the observation that the difference-process, 
z(-), generated in Theorem (|5.2|) can be represented as the unique solution 
to a stochastic Ito differential equation. 

Theorem 5.5 

Let v D denote the unique positive definite symmetric square root of D and 
let /3 be a process in M. d whose components f3j, j = 1,2, ... ,d are i.i.d. one- 
dimensional standard Brownian motions. Then the Markov-Feller difference- 
process of Theorem ( \5.3j ), z(-), generated by the transition probability func- 
tion (t,x,B) \—> P{t,x,B), can be represented as the unique solution to the 
following ltd stochastic initial value problem: 



dz{t) = y/D(z(t))P(dt), (5.69) 

z(0) = x. (5.70) 

Or, equivalently, the ltd stochastic integral equation 

z{t) = x+ f J D{z{s))j3{ds). (5.71) 

Jo 

By virtue of the structure Theorem JO, the latter can be cast in the form 



z(t) = x + ^a ± (\z(sW)P ± (z(s))(3(ds)+ y/v(\z(s)\*)P(z(s))j3(d8). 
'o Jo 

(5.72) 
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6 Separation-Process 

The matrix function x i— > D(x), characterized in the previous section, deter- 
mines the generator of the difference-process, which we have denoted by x(-). 
However, we are most concerned with the magnitude, |cc(-)|, of this process 
or, succinctly, the separation-process . The following argument shows that 
the separation-process is Markovian. 

If y, y G M d and \y\ = \y\ then there is an orthogonal matrix Q such that 
y = Qy. Now set y ■= r 2 - r l , y := r 2 - r£, so r 2 - r l = Qr 2 - Qr l Q . 
Then, by Lemma |TT] and recalling that y2x = r 2 — r 1 , v2xo = Tq — rj, 



x(t,x ) 

r(t,rl,w) -r(t,rl,w) 

(6 1) 

~ x(t,x ) 

Let B be a Borel subset of M and \l/(-) denote the inverse image of the map 
x — ► \x\. Then Equation ( |6.1|) and the preceding considerations imply by 
Dynkin [| that 

P(t,£,B)\ i=]:BQ] :=V{x(t, Xo ) eV(B)\x(0,x o ) = x } (6.2) 

is a transition probability function for a Markov diffusion in IR + . 

To find its generator, denoted by A, we compute the action of the differ- 
ential operator in Equation (5. E||) on functions / of the form /(a?) = </?(|cc|). 



rT . 



As a preliminary, compute the second derivative matrix V^V^/ whose com- 

d 2 

dxidxj 



ponents are given by „ d : ip(\x\). For x ^ 0, this computation yields 



(V*V-/» = ^M(l - =J) + /(M)=J (6.3) 

Here it is useful to use the projections P(x) and P _L (a;), defined in Equa- 
tion Q4.2J ): In terms of these projections, Equation ( |6.3|) is cast in the form 

(V x V T J)(x) = ^^P\x) + v"{\x\)P{x). (6.4) 
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Using the diffusion matrix D(x) in the form of Equation (|5.60 ) and Equa 



tion ( |6.4j ), the partial differential operator of Equation ( |5.43|) applied to 
f(x) := y(|cc|) reduces to the ordinary differential operator^] 

(Aip){\x\) := (Af)(x) 
1 



-yD(x).(VV T f)(x)j (65) 

'^(ixiVCkD + ^NVd*!) 



2V x 



In Section |7 3| we give an independent argument, using Dirichlet forms, that 



this operator is indeed the generator for the separation-process. 



Maxwell! 6.1 

For the special case of the \Maxwell kerne\ the operation of Equation ( |6'.^ ) 
yields 



Wei-VI^T 2 !' 



*(<■ 




(6.6) 



Again, if we invoke the normalization of Comment 3.1, we have 



C £ ,d = 1. (6.7) 

Comment 6.1 

For large £, the operator Aip in Equation (\6.b] ) (or Equation ( \6. 6] )) behaves 

IfhP 

(^)(0^t[^V(0 + ^'(0]- (6-8) 

Up to the constant multiplier, c, the right-hand side is precisely the generator 
of the Bessel process associated with a d— dimensional standard Brownian 
motion /?(•) (See Dynkin j^] Ch. X, (10.87)). In other words, for large £ = 
| a; | the Reparation-process] should behave like the Bessel process of the standard 
Brownian motion /3(-). Therefore, we say that the separation-process with the 
generator in Equation ( \6.3j ) is a process of Bessel-type. 



61 Recall that the inner product of two square matrices, M and N was defined by 

M.AT :=trace{MN T ). 
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For the general Bessel-type process with generator from Equation 
we obtain the following stochastic ordinary differential equation for the 
magnitude process, £ = |cc|: 

d£ = l^a±(e)dt + y/^)/3(dt), 

(6.9) 

m = &, 

where /?(•) is a standard real- valued Brownian motion. Observe that the drift 
term in Equation ( |6.9| ) is proportional to (d — 1) and depends only on the 
lateral eigenvalue while the diffusive (stochastic) term depends only on the 
radial eigenvalue. 

We may now apply the criteria from stochastic analysis of one- 
dimensional diffusions to obtain a proof of the long-time behavior of the 
|separation-procesi| for d > 2 and, therefore, of the attractive and repulsive 



behavior of the separation-procesj for large times (See Proposition 5.9 and 



Theorem 5.13 of Kotelenez E 



Theorem 6.1 (Long-Term Behavior of the |Separation-Process|) 

1. For d = 2, the solution of Equation ) \6.i\ ) is recurrent whenever the 
diffusion matri^ D is \radially dominant^ 



2. For d > 3, the solution of Equation ( \6.f\) is transient. 

As a consequence, if d — 2, the two large particles will attract and repel 
each other infinitely often and, if d > 3, the distance between the two large 
particles will tend to oo almost surely as t — > oo. 

Proof. The following functional was used by Gikhman and Skorokhod ( [ 
or HU, Ch. 4, Section 16), to study the asymptotic behavior of solutions 
of one-dimensional stochastic ordinary differential equations (See also 
Skorokhod [ 7j, Ch. 1.3). We will follow the notation of Ikeda and 
Watanabe [gl|], Ch. VI.3). 



62 



A sufficient condition for radial dominance is that the forcing function 4> satisfies the 



logarithmic convexity condition (ln0)" < 0. (See Theorem 5.4.) 
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*(£,()■= J exp -fid' 1 



a±(z 2 ) dz 
'C L J( ff 2 ) z . 

where ( is an arbitrary point in M. We have to show that 



dy, 



(6.10) 



«&0 



±oo 


as £ - 


^±oo if d = 2, 


and 






s± 


as £ - 


-» ±oo if <i > 3, 



(6.11) 



where — oo < s < s + < oo. Then the conclusion then follows from Theorem 



3.1 in Ikeda and Watanabe Q I|. The arguments that follow rely on the 
properties of the |diffusion matrix! given in Theorem ( |5.3|) . In particular, we 
note thatQ 



lim 



a±[z' 



<?±{z 



■ 1 and < ; J , V2 > 0. (6.12) 

2^00 a\z") a\z 2 ) 

Furthermore, when the [diffusion matrix! D is radially dominant we also have 



< ^4r < 1, V* > 0. 

a(z 2 ) 



1. For any numbers £, p, (, Equation (|6.10|) implies 



s(£,0 = s(p,0+exp 



(d-l)a ± (z 2 ) 



dz 



za\z* 



(6.13) 



s&p). (6.14) 



Thus we may choose appropriate values for p and prove (|6.11| ) for s(£, p) 
instead of s(£, C). It is easy to see that s(£, C) is positive whenever, 
£ > C? and £ I— > s(£, C) is increasing. Furthermore, a straightforward 
computation gives 

s(-^-p) = s(^p). (6.15) 

2. Fix an arbitrary positive number S < |. Then, by the limit in Equa- 
tion ( |6.12D and Equation Q6.13 ) there is a p > such that 

,2\ 



(1-S)< 



ax\z 



<j[z< 



<(! + *), 



(6.16) 



(i.-i 



The ratio in the second inequality of Equation ( 3.12 ) is < 1 in the case of the Maxwell 



kernel. This simplifies the proof of the first assertion for that case. 
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whenever \z\ > p. We can replace the upper bound (1 + 5) in Equa- 
tion (|6.16| ) by 1 whenever the [diffusion matrix| D is radially dominant, 
say when (ln0)" < 0. In this case, if z > p > we have 

(i-D->M >(d . 1)M I. ( 6 .17) 

z za[z z ) z 



3. Suppose d — 2 and £ > p > 0. Then, assuming the convexity condition, 



cxp 



(d-1) / 

Jo 



y v±(z 2 ) 



za(z 2 ) 



dz 



So, 



> exp 



£ 



s(f > C) > s(p, C) + C In ► oo as £ -»• oo. 



(6.18) 



(6.19) 



4. Suppose d = 2 and — £ < — p < 0. Then, using Q6.15|) , the previous 
argument yields 



lim s(-f , -p) = -oo. 
This verifies (|6.11| ) when d = 2 and (ln0)" < 0. 
5. Next, suppose £ > p > and d > 3. Then 



(6.20) 



S (^P) = / 
'p 
r€ 
< / exp 
'p 



(d-1) 



y v±(z 2 ) 
za(z 2 ) 



dz 



r^ r r y dz 

/ exp -(d-l)(l-5) / — 

Jo J z 



dy 



dy 



P 



< 



(d-l)(l-S)-l 

P 

(d-l)(l-5)-l 



(d-l)(l-6)-V 



(6.21) 



since (d — 1)(1 — S) — 1 > whenever d > 3 and < <5 < |. Therefore, 
since £ i— > s(£, p) is increasing we have 



lim s(£,p) = s + < oo, 



(6.22) 



for some s + 6 
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6. In view of (§^^[) we also get 



lim s(— £, — p) = s > — oo, (6.23) 

where s~ = —s + . So (^Tj) is verified for d > 3. 



The statements about the difference of the two large particles are a simple 
consequence of the recurrence and transience properties ■ 

7 Depletion Effect — Stochastic Model 

7.1 van Kampen's Probability Flux 

We generalize van Kampen's notion of probability flux rate to vector processes 
x(-) in M. d in order to characterize regions with a bias in favor of attraction 
or repulsion between particle pairs.0 (See van Kampen f 5j.) Let x ^ be 
a point and v be a unit vector in M. d . Such a pair determines an oriented 
hyperplane in M. d through x with orienting normal v. Let X(x, t) denote the 
probability density for the process x(-) at the point x and time t. We define 
the probability flux rate vector (at (x, t) with the orientation v) by 

j(x, t, v) := ~\x{x, t)D{x)v. (7.1) 

The vector j(x,t,v) is the area density of the inststantaneous probability 
flow rate at time t through a surface element at x oriented by v. (See 
Comment [? 5] for a 1— dimensional motivation.) The flux rate (at (x,t) with 
orientation v), denoted by J(x, v, t), is defined as the divergence of this flux, 
namelyf^l 

J(x, t, v) := V x • j(x, t, v) = V x • ( - -X(x, t)D(x)v J 

V l ' (7.2) 

1 1 ~ 
= — X(x, t)divD(x) »v D(x) V x X(x, t) • v. 



64 The regions in M. d to which we refer in this section are regions in the space of the 
variable x, the normalized vector particle difference. Thus, if one particle is viewed as the 
origin, the other is located at y/2x. 

65 Vj; denotes the spatial gradient in M. d . The symmetry of D is used in the second line 



of Equation (|7.2p . 
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Note that J(x,t,v) is associated with an oriented surface element at x; in 
fact, the functional v h-> J(x,t,v) is linear. 

Henceforth, for simplicity, we assume that, at time t, the probability 
density, y h- > X(y,t), is uniform (constant) in a neighborhood of cc. In this 
case, the flux rate reduces to 

J(x, t, v) = — X(x, t)divD(x) • v, (7.3) 

the v component of the vector — ^X(x,t)divD(x). Thus, if the probability 
density X is assumed locally spatially uniform at a given time, the flux rate 
is essentially a multiple of the divergence of the [diffusion matrixj ; that is, a 



multiple of the vector divD. Roughly speaking, this assumption means that, 
at time t, the probability of finding a second large particle near \/2aj, given 
that the first is at the origin, is locally constant in x. Since our computations 
will be essentially local with respect to a given point x, the assumption is 
reasonable, provided y i— > X(y, t) at time t is reasonably regular. 

In view of the special form of D(x), given in Equation ( |5.60| ), divD(x) 
is radial (parallel to x) ; specifically,!^] 

divD(x) = V x a ± (\x\ 2 ) + P{x){V x a(\x\ 2 ) - V x a ± {\x\ 2 )) 



a X 



2\ 



a ± (\x\ 2 ))divP(x) 



= V x a(\x\ 2 ) + (a(\x\ 2 ) - a ± (\x\ 2 ))divP(x) (7.4) 

= (2a'(\x\ 2 )\x\ + ^l(a(\x\ 2 ) - a ± (\x\ 2 )))u(x). 

For later convenience, define ^ i— > ip(£) on (0, oo) by 

m ■■= j-fie) + d -^ne) - ^(a), (7.5) 

so that divD(x) = ip(\x\)u(x).f^\ Thus, when the probability density 
X(x, t) is assumed to be uniform (constant) in a neighborhood of x at time t, 
the flux rate, J(x, t, v), is completely determined by ^(|a;|). We will be con- 
cerned with the radially oriented flux rate, J(x,t) := J(x,t,u(x)), which we 



66 For a square matrix valued function M(y), divM{y) is defined through the identity 
divM(y) • a = V y • M (y)a. Recall that D is symmetric. 



67 divP(x) = 4— yu(x). Recall the definition of u(x) in Equation (4.1) 
68 For d > 2 all spatial integrals involving ijj will converge at the origin 
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henceforth call the d— dimensional van-Kampen flux rate. J(x,t) takes the 
formf^] 

J(x,t) = K^j 1 ^ 2 ) ~ ^o-(e) - ±cr{e))x{x,t). (7.6) 

Note that, except for the probability density multiplier X(x,t), J(x,t) de- 
pends only on the magnitude £ of x; that is, on the separation. 

In the next sub-section we give a nice geometric interpretation of the 
d— dimensional van Kampen flux rate, J(x,t). 

7.2 Interpretation of van Kampen's Flux — Pill-Box 

In this section suppose d > 2.Q Let S d_1 denote the (unit) sphere in R d 
centered at the origin and let £S d_1 denote the sphere in M. d of radius £ 
centered at the origin. The (dimensionless) surface area of E> d ~ l is denoted 
by Ud-i so the surface area of £§ d_1 is £ d_1 ct;d_i.0 Fix a point z on § d ~ x and 
a cone angle ip, < ip < tx. The spherical cap, Wd-ii'P, z), on S^" 1 consists 
of those points z' on S d ~ x for which z' • z > cos(p^\ The (dimensionless) 
area of this cap, which depends only on the cone angle ip, is denoted by 
^-i(^)-[3 The spherical cap on £§ d_1 subtended by UJd-i(<p>, z) is denoted 
by ^zad-i(<p, z) and its surface area is ^ d ~ 1 ujd~\ (<p)- 

Now fix a point x ^ 0. Recall that u(x) := x/\x\, the associated out- 
ward unit vector (or point on S d_1 ), and set £ := \x\. For some small posi- 
tive A, < A <C 1, consider the spherical shell in M. d of thickness A between 
the concentric spheres £§ rf_1 and (£ + A)§ d_1 . Construct a small "pill-box" 
B(x, ip, A) in M. d by taking that portion of the shell subtended by the spher- 
ical cap w(ip, w(a?))-Q The volume of the pill-box B(x, ip, A) is given by the 
expression 

^(K+Ar-a^U-.i^-'A. (7.7) 



69 Provided the initial probability density X(x,t) is locally spatially constant. 

70 With some obvious modifications, the results of this section hold as well if d = 1. 



71 uj d _ ! = 27r5r(|)- 1 is the surface area of S^ 1 . 

72 Thus, ip is the azimuthal angle in a general spherical coordinate system for M. d whose 
"north pole" is determined by z 

73 Hence, Wd-iM = OJd-i and Wd-i(</') = ^d-2 Jq sin d_2 Odd. 

74 The bounding surfaces of the pill-box are coordinate surfaces in general spherical 
coordinates for M. d . 
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The boundary, dB(x,<p,\), of the pill-box consists of the inner and outer 
ends, with areas 



together with the lateral surface, whose area is 



Ud-2 „.„d_l 



vd-1 c d-l\ °<i< 1 



Refer to the Figure (0) below for a visualization.^ 




^ V ^' 



i^ 



(7.8) 



^i sin^ 1 v? • ((£ + A)^ 1 - £*-*) « ^-2 sin^ 1 ^ • ^ 2 X (7.9) 



Figure 9: The Pill Box B(x, <p, A) 



The net probability flux rate vector out of B(x,ip,X) (at (x,t)) is the 
surface integral 



j(y,t,n(y))dA(y) 



c)B 



--X(x,t)D(y)n(y)dA(y), (7.10) 



r)B 



Geometric terms such as surface area, volume, perimeter, unit outward normal vector, 
etc. must be construed in the d-dimensional context. 
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where n(y) always denotes the unit outward normal vector to dB(x, tp, A) 
at y. Using the form of D(y) in Equation ( |5.60|) , we can compute this flux 
explicitly. Here is the computation:^] 

j(y,t,n(y))dA(y) 

dB 

-\x{x 1 t)D{y)n{y)dA{y) 

dB l 

l -X{xM f a(\y\ 2 )n(y)dA(y)+ f a ± (\y\ 2 )n(y) dA(y) 

^ I ./ends J lateral 

•-A(a;,tj<a; d _2 (£ + Aj <x((£ + A) ) - u; d _ 2 ^_ - £ cr(4 J 



^_ 2 (sin d "V) | €+A ^(C 2 )C d - 2 rfc| 



It UE 



,d-l 



= -Ijf(x, *H-2^f I (* + A) d " V((£ + A) 2 ) - ^V(e 2 ) 

- (d- l)(y €+A (r J .(C 2 )C*- 2 dc) Ma:) 

o<a<i 1 sin d_1 oj ■ , 

« --X (jb, t Vd-2 , , f'^ (flt* a; , 
2 o — l 

(7.11) 

where ip is given in Equation ( [/.Sp )-! 77 ! 

There are two important observations to make regarding the computation 
in Equation ( |7.11| ). First, the contribution to this net flux rate vector through 



the ends depends only upon the radial eigenvalue a while that through the 
lateral surface depends only upon the lateral eigenvalue a±; moreover, the 
lateral contribution is always positive. Second, the net flux rate vector out of 
the pill-box is always radial in the sense that it is a multiple of the outward 
unit radial vector u(x); thus, 

/ J(V, t, n(y)) dA{y) = -\x(x, *)//(£, *, V, A)u(x), (7.12) 

JdB Z 



76 0n the ends P n — 0, Pn = n. On the lateral surface P n — n, Pn = 0. 
77 Rccall that the probability density y i— > X(y, t) is assumed to be constant in a neigh- 
borhood of x for each fixed t. 
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where /i = //(£, t, p, A) is a scalar multiplier, which can be computed from the 
penultimate expression in Equation ( fTTTI) . The Divergence Theorem implies 

that 

-\x{x,t) f divb{y)dV{y) = - l -X{x,t) f D(y)n(y)dA(y). (7.13) 

1 JB l JdB 

Hence, using Equation ( |7.1C| ), 

- l -X{x,t) J divD(y)dV(y) = -^X{x,t)n{\x\,t,cp,X)u{x). (7.14) 

If the scalar multiplier /i in Equation ([T ^) is positive, we can say that 
net probability flux rate vector out of B(x, p, A) is a radial vector pointing 
inward toward the origin. We interpret this to mean that, if the probability 
distribution at time t is locally spatially uniform, there is an instantaneous 
net statistical tendency for points x in B(x, ip, A) to leave the pill-box and 
move radially inward toward the origin; in this sense, for B(x, ip, A), there is a 
net statistical tendency for \x\ to decrease and, hence, pairs of large particles 
have a net statistical tendency to move closer together. In Section [7 3] we we 
give an alternative context in which this interpretation is reenforced. 

Definition 7.1 

If the scalar multiplier \i in Equations (\7.13j\7.Lfi) is positive, we say that the 



region B{x,p,\) is attraction-biased or attractive. Similarly, if /i in Equa- 
tions ( \7.13j , \7.1^\ ) is negative, we say that the region B(x, cp, A) is repulsion- 
biased or repulsive. If ft is zero we say the region B(x,p,X) is neutral. 

By Equations (|7.4} - [7.6|) , we can decide whether B(x, p, A) is repulsive or 
attractive (or neutral) by assessing the algebraic sign of ip in Equation ( |7.5|) . 
Specifically, if A is small, the volume of B(x, p, A) is small, so B(x, tp, A) is 
attractive if "0(1^1) > and repulsive if ^(|je|) < 0. Indeed, evaluating the 
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left side of Equation (|7! ~3|) directly yields^] 



~X(x,t) j divD(y)dV(y) 



sir/" 1 in ( fW +X 



2 v ' ' u_i d-\ 



|*| 



,d-l 



o<a<i l sm tp. ., , ,.. .. , . ._ ., _ 

« --X(a;,0^-2 H ecl^A^MMa:). (7.15) 

Based on this discussion we extend our definitions to more general regions 
in Mr. By a region we here mean a closed and bounded set in M so regular 
that the Divergence Theorem applies. 

Definition 7.2 

A region in M. d , not containing the origin, is attraction-biased or attractive 



whenever the scalar function ip in Equation ( \7.3j) is positive inside the region, 
repulsion-biased or repulsive whenever ip is negative inside the region, or 
neutral whenever ip is zero inside the region. By extension, we say that the 
point x ^ is attractive, repulsive, or neutral whenever ip{\x\) is positive, 
negative, or zero.f^\ 

Comment 7.1 

If the probability density y i— » X(y,t) is not constant in a neighborhood of 
x, but \V x X(x,t)\ is small, our conclusions are not much affected. In this 



case, the flux rate J(x, t) in Equation ( \7.6j ) will have the small additional 
term — \a(\x\ 2 )(u(x) • V x X(x,t)). 

Comment 7.2 

A geometric observation is in order here. The outward radial component of 
the net probability flux rate vector out of B(x,ip,\), defined through Equa- 
tion JJJ7. TQ), is not the same as the flux out of B(x, if, A) of the outward 



78 Observe that 

d- 1 



me- 1 = ^(V- 1 ^ 2 )) - ^jV 1 ^)), 



which shows that the expressions in Equations ( 7.11 ) and ( 7.15 ) are the same and, hence, 
independently verifies the Divergence Theorem for the pill-box. 

79 Since all integrals over regions including the origin converge, we could extend the 
definition to such regions by a straightforward limiting process. Bear in mind that the 
regions to which we refer here are regions in the difference space of the variable x. 
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radial component of y i— * j(y,t,u(y)). The latter depends only upon the 
radial eigenvalue, a, while the former depends on both a and the lateral 
eigenvalue, a±. Indeed, they differ by ^X(x,t) J B yyO"±(|?/| 2 ) dy, which is a 
consequence of the identity 

1 A 1 

(divD(x))»u(x)-div(D(x)u(x)) = ---D(x)»P ± (x) = ^a ± (|a;| 2 ). 

\x\ \x\ 

Thus our van Kampen flux rate depends on the lateral as well as the radial 
effects induced by D. 

Comment 7.3 

Forv = u(x) we see that the algebraic sign of the van Kampen flux J(x,t) is 
opposite that ofijj(\x\). The algebraic sign of the radially oriented van Kam- 
pen flux rate density determines whether a point is attractive, repulsive, or 
neutral. Of course, the same is true for any outward v; that is, for any v 
such that v • u(x) > 0. 

Comment 7.4 

Our notion of attraction/repulsion- bias is instantaneous in the sense that it 
is expected to be valid over a short time interval after the initial time t. For 
if there is an attraction- or repulsion-bias at some time t, we do not expect 
the distribution X(x,t), initially assumed uniform, to remain so for long. 

Comment 7.5 

It is instructive to relate our definition of probability flux rate vector in 
Equation ( \7.I\) with the one-dimensional case. Let B GK denote the interval 
B := [x, x + X], where for definiteness we take < x < x + A. Again, X(x, t) 
denotes the probability density. The probability transfer rate from x to x + A 
(left-to-right) is ^X(x,t)D(x) while the probability transfer rate from x + 
X to x (right-to-left) is — \X{x + X,t)D(x + A). The net probability transfer 
rate between x and x + X is therefore ^X(x,t)D(x) — ^X(x + A, t)D(x + A, t). 
If the the initial probability density is locally spatially uniform, then, 

-X(x,t)D(x) - -X(x + X,t)D(x + A) = -X(x,t)[D(x) - D(x + A)] 



2 v ' 7 A 

-X(x,t)XD f (x). 



(7.16) 
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In particular, if D'(x) > and < A < 1, the net probability transfer rate 
between the endpoints of the small interval, B, is negative; that is, toward 
the origin. 

Now reproduce our pill-box discussion for d — 1; that is, in the vector 



space R 1 . The one-dimensional version of Equation ( \7.1\ ) is j(x,t,v) = 
— ^X(x,t)D(x)v. Here, v is construed as a one- dimensional unit vector 
in R 1 , namely v — ±1. In this context, the unit outward normal vector 
to B at x is n(x) = — 1 and the unit outward normal vector to B at x + A is 
n(x + A) = +1. Therefore, the net probability efflux rate vector out of B is 
(assuming that the initial probability density is locally spatially uniform) 

j(x, t, n(x)) + j(x + A, t, n(x + A)) = 

-X(x, t)D(x)n{x) ) + ( - ~X{x + \ t)D(x + X)n(x + A) 

-±X{x,t)D(x)(-l)) + (-^X{x,t)D(x + \)(+: 

= ~X(x,t)(D(x + X) - D(x)) 

w --X(x,t)XD'(x). (7.17) 

Thus, for d = 1 , the net probability flux rate vector out of B is precisely the 
net probability transfer rate between the end-points of B. If D'(x) > and 
< A < 1, this vector points toward the origin. Note that —^X(x,t)XD'(x) 
is approximately the integral of the van Kampen flux: J B J(y, t, +1) dy. 

Of course, when d = 1 there is no lateral effect to consider. When d > 2 
the lateral effect is significant. 



Maxwell! 7.1 



For the \Maxwell kernefj g e , given by Equation (J5j T^), the divergence of the 



diffusion coefficient matrix D £ (x) in Equation ( \5.4(\ ) is the vector valued 
function 

divD e (x) = %l){\x\)u{x) = ^e- ] ^\x\({2 + d)e- \x\ 2 )u(x). (7.18) 



The van Kampen flux when the \forcing kernel g e , is derived from a 



Maxwellian velocity field for the small particles, is immediate from Equa- 
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tion (FTW: 



J(x, t,v)=(- ±X(x, t)^e- lJ ^\x\ ((2 + d)e - \x\ 2 )\ (u{x) • v) (7.19) 



and, hence, 



1 v( ,,c e4 _Hi, 
-X (x,t — - - 

2 v ' ; 2e s 



J(x, t) = --X(x, i)^f e-^r \x\ ((2 + d)e - \x\ 2 ) (7.20) 



We conclude that, for g E , the region {x E M. d : \x\ < \/{2 + d)e} is 
attraction-biased, and the region {x £ ~R d : \x\ > ^/(2 + d)e} is repulsion- 
biased. The points of neutral bias have measure zero in M d . Note that the 
radius of the attraction-biased region is proportional to the correlation length 
^fe and increases with dimension. We should expect a similar result for 
any unimodular distribution. Figure ([TTjj is a graph of the normalized van 
Kampen flux J as a function of the separation £ = |aj| for the Maxwell kernel. 

Observe that if we invoke the normalization of Comment [3 J| ; so that 
x \— > ^\g £ (x)\ 2 is a probability density on W d , then -4=a/(2 + d)e is the stan- 
dard deviation of the corresponding probability distribution for x. In view 
of the fact that x = -75 (?" 2 — r l ), we see that the separation that deter- 
mines the region of attractive bias is precisely the standard deviation of the 
corresponding distribution for the difference r 2 — r 1 . 

In view of Comment [3^|, this conclusion will not change if another nor- 
malization is used, provided that the probability density x i— ► ^\g £ (x)\ 2 is 
replaced by the appropriate probability density, say x i— > ^j-\g £ (x)\ 2 . 

The curve in Figure ( |7T^ measures the statistical tendency toward cluster- 
ing with separation. Compare it with the curves in Figure (j| B) that measure 
the potential between a large particle and a substrate with separation. Qual- 
itatively they are similar. We expect that any assumed velocity distribution 
of the small particles that is qualitatively similar to the Maxwell distribution 
will tell the same story. 
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0.0 



•d 




0.0 



€ 



Figure 10: The (normalized) van Kampen flux 
separation distance, £ = \x\. 



J(x, t), as a function of the 



Comment 7.6 

Starting from a uniform distribution, at least for short times, we expect that 
the distance between the two particles in attraction-biased regions would tend 
to decrease and in repulsion-biased regions this distance would tend to in- 
crease. In particular, for particles sufficiently close together, the attraction 
bias would be consistent with the depletion phenomena (clustering) observed 
in colloids as described in Section [|. However, the long-time behavior may 
be different. Indeed, a proof of the following result can be found in Skoro- 
hod W\h Ch. 1.3: Assume d = 1 and let z(t,b) be the unique solution of 
Equation \5. 71\ ) with z(0) = 6^0. Then, 



V[\im z(t,b) = 0] = 1. 

t— >oo 



(7.21) 



This means that, on the average, the distance between two particles on the 
real line will eventually tend to zero. In other words, the whole positive 
and negative real lines are attractive regions; any repulsion-biased regions 
will have no long-term effect on the distances between the two particles. We 
believe the difference between long-time behavior and short-time behavior in 
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repulsive regions for dimension d = 1 to be the result of the recurrence of 
one- dimensional Brownian motions. 

Comment 7.7 

In TheoremW^h we showed that the long term behavior of the difference r 2 — r 1 



is recurrent for d = 2, provided the diffusion matrix is \radially dominant , and 



transient for d > 2 (See Kotelenez $£$/)■ Our discussion, based as it is on 
the initial uniformity of the probability distribution X(x,t), is necessarily a 
short-time result. The connection between the short- and long-time behavior 
remains an open and interesting problem, which will be investigated in future 
research. 

7.3 A Tale of Two Fluxes 

In Section p, the generator, A, of the [separation-process! , £ = \x\, was derived. 
If A of Equation ( |6.5| ) is used directly in (0, oo) to compute the 1— dimensional 
van Kampen flux rate for the process £, the result is 



j(x,t) = 1(^-1^) ~ ^-Me))x(x,t), (7.22) 



assuming the initial probability density, X(x,t), to be locally spatially uni- 
form.Q van Kampen's procedure for this is provided in Appendix [F[ This 
flux rate, of course, is different than our <i-dimensional van-Kampen flux 
rate, given in Equation ( |7.6| ). The difference between them is that our flux 
has an additional radial term, — ^-<t(£ 2 ), which is significant. Indeed, for 
the Maxwell kernel, our version predicts an attractive region for every value 
of d > 1, which region increases with d. On the other hand, for the same 
Maxwell kernel, the van Kampen flux of Equation (|7.22|) predicts such an 
attractive region only for d = 1,2, 3, which region decreases with d.f^ This 
begs the questions: Why are the two fluxes so different?^ And, which one 
provides an appropriate means to measure a tendency toward clustering? 

To address the relationship between the two fluxes we consider the Dirich- 
let quadratic form associated with the generator of the Markov- Feller process 



80 For this computation we suppose that the probability density X depends on x through 
£ = \x\. In view of the assumption, this is no restriction. 



81 See Comment \T2. 

82 Of course they do agree when d = 1. 
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given in Equation (^ []])-|_] For suitable scalar- valued functions /, g defined 
in M. d this quadratic form is^] 

£(f, g) := (-Af, g) = - f d \ (£)(x) • V x V T J\x))g{x) dx (7.23) 

The breve (") over the £ here signifies that the underlying equilibrium mea- 
sure for the quadratic form is Lebesgue measure, dx, in ~R d . If / and g are 
sufficiently regular to ensure that the boundary terms associated with the 
Divergence Theorem vanish, the quadratic form in Equation ( |7.23| ) can be 
rewritten as0 

E(f,g) = (-Af,g) = [ l -b{x)V x f\x)*V x g{x)dx 

X ' J I (7.24) 

+ / -divD(x) • 'V x f(x)g(x) dx. 

Jmd 2 



Next we compute the quadratic form, S, of Equation ( |7.23|) or ( |7.24| ) for 



radially dependent functions: f(x) := <^(|cc|), g(x) := -y(|a?|) , as we did in 
Section |B|. If we suppose that the functions (p and 7 are sufficiently regular 
on M + , Equation (|7j24|) becomes^ 



S(f J g) = (-Afr 9 ) = 1 - f b(x)u(x).u(x) i p'(\x\)i(\x\)dx 

H — / divD(x) • u{x)ip'{\x\)^{\x\) dx. 
2 Jm. d 



(7.25) 



Observe that the first term in Equation (7. | or [/ 25) itself determines a 
Dirichlet quadratic form since it is positive definite and, more important, it 
is symmetric with respect to Lebesgue measure, dx, in M. d . The second is 
neither symmetric with respect to Lebesgue measure, dx, in M d nor definite.^ 

83 For a discussion of Dirichlet forms see Fukushima llq] or Ma and Rockner p(J • 

84 We follow the convention using —A in the quadratic form. Also, recall that V X V X / 
denotes the square, symmetric matrix whose components are given by g x g x . f( x )- 

85 Surely this is the case if / and g have compact support. 

86 We suppose that ip, 7 are smooth and have compact support in (0,oo). Recall that, 
for x ^ 0, u(x) = |fy, Vx¥>(|a;|) = ^'(|a;|)it(x), etc. 

87 The second quadratic form, while not symmetric, is not anti-symmetric in our context; 
in fact, it is anti-symmetric if and only if ip(£) = fd _! , for some constant k. 
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We now use the representation Equation ( |5.60| ) of Theorem |S 3| together 
with the formulas of Equations ( |7.4|) and ( [?T5|) to rewrite Equation ( |7.25|) as 



£(f~rg) = (-Af,g) = \f a(\x\ 2 )<p'(\x\W(\x\)dx 

jRd (7.26) 

+ \ [ ^(1*1)^(1x1)7(1x1)^. 

Recall that divD(x) = ijj(\x\)u(x), so the direction of this vector is always 
radial; the scalar-valued function ip is just its outward radial component. 
Since the integrands in Equation (|7.26|) depend only on the radial component 
of x, £ = \x\, we can rewrite Equation ( [7.26 ) in terms of integrals over [0, oo). 
ThusQ 

? 7oo (7.27) 

The bar (~) over the £ indicates that we regard the underlying equilibrium 
measure of the quadratic form restricted to radially dependent functions 
to be the 1-dimensional weighted measure md(d^) on [0, oo) given by the 
d— dimensional volume element 

m d {di) := u d ^ d ~ l d£. (7.28) 



Observe now that the first term in Equation (|7.27| ) is symmetric with respect 
to the measure defined by md(d£) := uJd-iC 4 ^ 1 d£ and the second is not 
symmetric with respect to this measure. For later convenience, write 

1 f°° 
E.(tp, 7 ) := ^ J ^(£ V(£ 2 )7'(0 mM), (7-29) 

1 f°° 
£ns(<P, 7) := 2 J ^(£V (£)7(£) m d (d£), (7.30) 

so E(tp, 7) = £ s ((p, 7) + £ ns ((p, 7). 

Finally, after a bit of standard manipulation, involving another 
integration-by-parts, we obtain 

%,7) = (-Af,~g) = - J™ l(v(ew'\o+^v±(ew\o)i(o™M)i 

(7.31) 



88 Recall that LOd-i is the surface area of the unit sphere §> d 1 . 
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or, equivalently, 

E(f,g) = (-Af,g) = [ (-Atp)(\x\ft(\x\)dx, (7.32) 

where A is the Bessel-type operator defined in Equation ( |6.5|) . 

Using the quadratic form, we can decompose the generator A of Equa- 
tion ( |6.5| ) into two parts, A = A s + A ns , where A s is the generator associated 
with the negative definite, symmetric part of the quadratic form in Equa- 
tion ( [7.271) and A ns is associated with the remaining non-symmetric part. 



Thus, 

(A^)(£) = ^ (^^(£V(0 + 2£a'(£V(0 + *(£V(0 

= ^((e d -v(e 2 ))V(e) + (^v ( ay(o) (7 . 33) 




(a^xo = g (^-^(evco - — *(£V(o - 2e^(e 2 )^(o) 
1 * /d " 1 (e d -Vx(f))^ / (e)-(e d -V(a)V(0 



-^(0^(0- 



(7.34) 



Our (i-dimensional flux corresponds to the second, non-symmetric, part of 
the Dirichlet quadratic form in Equation ( [7.24| or |7.25| ). Indeed, J(x,t) = 
— 7;ijj(\x\)X(x,t) is the d-dimensional van Kampen flux rate given in Equa- 



tion (|7.6p of Section [7 



There is a Markov diffusion in R associated with the first, symmetric, 
part of this form, generated by A s ; moreover, this diffusion has a symmetric 
transition probability density. That is, in a given time interval, the proba- 
bility of moving from x to y is the same as moving from y to x.f^\ For the 
magnitudes, there is also a Markov diffusion in [0, oo) associated with this 
term with a symmetric transition probability density. So, mutatis mutandis 



} See Kotelenez @. 
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the probability of moving from \x\ to \y\ is the same as moving from \y\ 
to \x\. Thus there can be no net flux rate associated with the symmetric 

part of (—Af,g). The remaining non-symmetric part, associated with A ns , 

therefore characterizes the deviation from this neutral bias; it agrees precisely 
with our rf-dimensional flux rate, Equation ( [7.6|) , computed by the pill-box 
argument in Section [7]. Note that A ns also generates a Markov process, but 
it is deterministic (first-order) and, hence, not a diffusion. 

Next, with a bit of manipulation, we produce an alternative decomposi- 
tion of the quadratic form £(tp,j) of Equation (|7.27|) . Namely, 

Zfai) = \[ (^(£ 2 ) - ^^(e))^'(Ol(Om d (dO (7.35) 

1 r°° d / d \ 

"2 1 5^«V K) ) 7KW<e (7 ' 36) 

Write £(</?, 7) = £~i(<p, 7) + £2^, 7), where 

1 /* OO 7 7 1 

£i(^l) ■= g I (^ 2 ) - -^*±(e))v'(Ol(0™ d (da (7-37) 

1 r°° d / d \ 

£2(^1) := -^ J o ^(v(e) T ^(0)l(OrnM)- (7-38) 

In this alternative decomposition neither £x((p,j) nor £2(^,7) is symmetric 
with respect to the equilibrium measure md(d£). However, suppose we for- 
mally replace the equilibrium measure rrid{d^) with Lebesgue measure d£ in 
both £\(<f, 7) and £2^, 7), thereby ignoring the fact that the underlying con- 
text is (i-dimensional. Then £ 2 (<£>, 7) is symmetric with respect to Lebesgue 
measure <i£ and would correspond, in this context, to a symmetric transition 
probability density; that is, it would be neutral in the sense that it would 
contribute no net flux rate. The non-neutral term, £i(<p, 7), now corresponds 
to van Kampen's 1-dimensional flux rate in Equation ( |7.22|) associated with 
the generator A, provided the initial probability density X(C,,t) is locally 
spatially constant. (See Appendix |F|) 

This discussion shows that for the diffusions in M. d , d > 2, which we 
consider, the appropriate equilibrium measure to compute a van Kampen 
flux rate for the [separation-process] , £ = \x\, is the weighted measure md(d£), 
not Lebesgue measure, <i£. The weighting reflects the adjustment needed 
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to account for the fact that the 1-dimensional generator is, in this case, a 
radial reduction from ^-dimensions. If van Kampen's procedure is applied 
directly to the generator A, the underlying equilibrium measure is tacitly 
Lebesgue measure. For d > 2, the resulting flux rate will not then reflect the 
(i-dimensional nature of the problem. In view of the spherical symmetry, this 
is essentially a lateral effect.^ We therefore claim that the (i-dimensional van- 



Kampen flux rate we defined in Equation 7 3 should provide an appropriate 



and faithful measure of the tendency toward clustering. 



Maxwell! 7.2 

In Comment [j Z] we compute our d- dimensional van Kampen flux for the 
specific case of the Maxwell Kernel. As the computation shows, there is a 
small region of attraction bias of magnitude ^/(2 + d)e. The size of this 
region increases with dimension d. On the other hand, In Appendix we 
compute the 1-dimensional van Kampen flux associated with the generator 
A for the Maxwell kernel. For this flux there is a region of attraction bias 
for dimensions d = 1,2,3 only; moreover, the size of the region decreases 
with d. The drift term corresponding to A depends only on the lateral prob- 
ability flux, whereas, from our pill-box argument, the drift term in our d- 
dimensional flux includes the effect of the radial probability flux through the 
ends. Without it this lateral effect is geometrically dominant. For d > 3, 
it completely overwhelms any inward radial bias. We have argued that our 
d- dimensional flux is an appropriate measure of attraction bias since it does 
not include the geometrical, dimension dependent, effects associated with the 
neutral (symmetric) part of the Dirichlet quadratic form. We claim that it 
faithfully represents the net flux rate of x out of a small region in M. d . It is 
not the same as the 1 — dimensional van Kampen flux rate of \x\ for a small 
region of (0, oo) associated with the generator A. 

8 Summary of Results — Further Research 

The depletion effect, which accounts for the tendency toward clustering in 
colloids, even in the absence of electrostatic forces or intermolecular (van der 
Waals) forces, can now be measured with some precision. The very term used, 
namely depletion, is predicated on a notion of the interaction between the 



on 



An exam inati on of the the lateral term (the term containing a±) in the last equality 



of Equation ( 7.11 ) shows that this lateral contribution is always positive. 
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relatively few large particles, which comprise the solute, and the very large 
number of very small particles, which comprise the solvent. We reviewed the 
classical model, based on the interaction of a few large spheres with many 
more small spheres. We also reviewed some related experimental methods 
and data. 

In this work we took an essentially statistical/probalistic approach to 
modeling the depletion effect. The foundation of our model is an interactive 
system of finitely many large particles and infinitely many small particles. 
The dynamic equations for the interaction at the microscopic scale are es- 
sentially Newtonian. A kinetic model for the positions of the large particles 
at the mesoscopic scale is obtained through an appropriate stochastic limit. 
These equations are kinetic in the sense that the scaling limit renders negligi- 
ble the inertial effects of the ensemble of small particles on each large particle; 
that is, in the limit, only the fluctuation effects of the small particles on the 
position of a large particle are retained through a mean field force. The 
kinetic equations retain a small length scale, the correlation length, which 
derives essentially from the variance of the distribution of velocities assumed 
for the small particles. 

We started with four phenomenological desiderata that a good model 
should have and that our model exhibits. Starting with our kinetic model 
we computed the correlation of the joint motion of a pair of large particles. 
Although the motion for each individual particle is Brownian, the joint mo- 
tion of the pair is not. (The joint motion becomes uncorrelated in the limit 
as the correlation length approaches zero or as the large particles move far 
apart.) Specifically, we computed the covariance (diffusion) matrix for the 
joint motion of two large particles. As a consequence of fundamental ma- 
terial restrictions (|isotropy| ) , this matrix was shown to have a very special 



structure. Using this special structure and a classical pill-box argument, 
we then defined and computed a probability flux rate that, starting from a 
uniform distribution, was used to measure the tendency for the separation 
between two particles to decrease or increase. We computed this tendency 
explicitly in the case that the velocity distribution of the small particles at 
the microscopic scale is Maxwellian. We showed that if the two particles 
are sufficiently close together, as measured by the correlation length, there 
is a statistical bias in favor of further decrease in separation --a depletion 
effect. We also showed how our flux rate is an appropriate generalization of 
van Kampen's one-dimensional flux rate. In parallel discussions we explicitly 
computed the infinitessimal generators for the two stochastic processes con- 
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sisting of the (vector) difference and the (scalar) distance between the two 
particles. We then showed the precise relationship between these generators 
and our version of van Kampen's one-dimensional flux rate. 

We emphasize that our results predict an initial statistical tendency to- 
ward clustering from a uniform state. Our procedure can easily be modified 
to account for a non-uniform initial state, but we did not do so here. Of 
course, after the onset of any clustering, the state may no longer be uniform. 

On the basis of our model, the long-term behavior of the separation be- 
tween two large particles was shown to be recurrent in two dimensions and 
transient in higher dimensions. The relationship between our short-term 
results and our long-term results needs further study. 

Our model is kinetic; there are no forces in the Newtonian sense. There is 
an underlying force structure in the original dynamic interactions, but they 
are only implicit in the derived kinetic model. To associate forces directly 
with the clustering effect we derive would require embedding our model in 
some sort of variational structure. This is desirable from a conceptual and 
physical point of view but, we believe, not strictly necessary. 

There is a compelling qualitative comparison between the behavior of our 
flux rate, as computed for the Maxwellian case, and potentials experimentally 
measured between large particles and a substrate or, equivalently, between 
two large particles. Since the latter are in good agreement with the classical 
hard sphere model for the depletion effect, we are encouraged to believe that 
our stochastic approach can faithfully model this effect in colloids. 

Avenues of further investigation that may be fruitful using our approach 
are: 

• Devise experiments to measure directly the effects we compute. A force 
model may be needed. 

• Incorporate interactions between large particles and inertial effects. 

• Compute the tendency to clump from non-uniform initial states. 

• Produce a directed model; that is, give the particles some preferred 
direction (shape). 

• Reproduce the Vrij model computations in ^-dimensions. 

• Investigate the difference between long- and short-term behavior. 
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A Isotropic Functions 



Here we outline a proof of Lemma |J]. We begin with a definition, a state- 
ment of Cauchy's Representation Theorem for isotropic functions, and some 
useful results of linear algebra. 

Definition A.l 

A scalar-valued function 4>(vi, v 2 , . . . , v m ) of m vectors in M. d is isotropic 
whenever 

4>(v u v 2 , ■ ■ ■ , v m ) = <f)(Qv ll Qv 2 , ..., Qv m ), (A.l) 

for every orthogonal transformation Q.Q 

Theorem A.l (Cauchy's Representation Theorem) 

<f)(vi, v 2 , • • • , v m ) is an isotropic scalar-valued function of m vectors in M. d if 

and only if it can be expressed as a scalar-valued function ip of the ^^ — - 
inner products {v t • Vj : i,j — 1,2, ... , m}.P| 

For a transformation B in M d and a scalar /3, the set {y : By = /3y} is 
a subspace of M. d . If this subspace contains a non-zero vector, say d /0, 
it is called a characteristic subspace for B corresponding to (3 and v is a 
characteristic vector (eigenvector) for B corresponding to characteristic value 
(eigenvalue) (3. 

Let t?bea non-zero vector. A transformation Q(v) is a simple reflection 
(through the hyperplane {v } x ) whenever Q(v)v = —v and Q(v)a = a, for 
all a e {v} ± . It is easy to see that Q{v) is an orthogonal, but not proper 
orthogonal, transformation; that is, det(Q(v)) = — 1. Q(v) is also symmetric 
and satisfies 

Q(v)P(v) = P(v)Q(v) = -P(v), 
Q(v)P ± (w) = P ± (v)Q(v) = P ± (v), (A.2) 

Q(v) = -P(v) + P ± (v). 



91 If the underlying invariance group consists of the subgroup of rotations (proper or- 
thogonal transformations), <p is said to be hemitropic. 

92 This is the version stated and proved in Truesdell and Noll En] (1965, §B,II,11). There 
is a corresponding result for hemitropic functions that requires the inclusion of all deter- 
minants. 
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Theorem A. 2 (Commutation Theorem) 

Let A and B be two transformations that commute. Then A leaves each 
characteristic space of B invariant^ 

An immediate corollary of the Commutation Theorem is 

Corollary A. 3 (Reflection Corrolary) If B is a transformation that 
commutes with the simple reflection Q(v), then v is a characteristic vec- 
tor for B. In this case, B has the representation 

B = (u(v) • Bu(v))P(v) + P L {v)BP L (v) = X(v)P(v) + P ± (v)BP ± (v), 

(A.3) 
where X(v) is the eigenvalue associated with v. 



As a consequence of the Reflection Corrolary |7l3] we see that if B commutes 



with Q(v), for every v j^ 0, then every v ^ is a characteristic vector for 
B. The latter is equivalent to B is a multiple of the identity.^ Since simple 
reflections are orthogonal, but not proper orthogonal, we have 

Theorem A. 4 

A transformation commutes with all orthogonal transformations if, and only 
if, it is a multiple of the identity^ 

Comment A.l 

The following result is useful when the group of orthogonal transformations 
is replace by the sub-group of proper orthogonal transformations (rotations). 

Theorem A. 5 

A symmetric transformation commutes with all proper orthogonal transfor- 
mations if, and only if, it is a multiple of the identity. 

To see this, consider a linear transformation B that commutes with all proper 
orthogonal transformations Q. Suppose further that B has at least one char- 
acteristic vector; that is, there is a vector d /0 and a scalar (3 such that 



93 Conversely, if a transformation A leaves each characteristic space of a symmetric 
transformation B invariant, then A and B commute. 

For every t»^fl, there is a number X(v) such that Bv = X(v)v. It follows easily that 
X(v) is constant for all v ^ 0. 

95 In fact, a transformation that commutes with all proper orthogonal transformations 
and a single simple reflection is a multiple of the identity. 
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Bv = j3v. Then, by the Commutation Theorem ^_~J^, Qv is a characteristic 
vector for B corresponding to (3 for every proper orthogonal Q. This implies 
that all non-zero vectors are eigenvectors for B corresponding to the eigen- 
value (3 or, succinctly, B = (31. Finally, if B is symmetric it surely has a 
characteristic vector. 



Proof of Lemma ITT . First note that when the dimension d = 1, Parts flip 



and (JD simply assert that the function is even, while Part (§) asserts that 
it is odd. In all three cases the result clearly holds. Henceforth we suppose 
that d > 2. 

Part flip follows at once from Cauchy's Representation Theorem "?. I with 
m = 1. 

To verify Part (|||), observe that |isotropy| implies that /(0) = 0, so we 
may suppose that r ^0. For an arbitrary vector a, define <p(r, a) by 

(f)(r,a):=f(r)*a. (A.4) 

By its construction, the function a i— > </>(r, a) is linear for each r/0. It is 
easy to show that <fi is an isotropic scalar- valued function of the two vector 
variables r and a. Hence, by Cauchy's Representation Theorem [A.l| , 



/(r»).a = ^(|r| 2 ,|a| 2 ,r.a), (A.5) 

for some scalar-valued function <p of three scalar variables. With r ^0 fixed 
Consider the action of the linear function a i— ► (/>((|r| 2 , |a| 2 ,r • a) on the 
subspace jf}" 1 ". The function a i— ► y9((|r| 2 , |a| 2 , 0) on {r} 1 - must also be 
linear, so 

^((|r| 2 , | - a| 2 ,0) = ^((|r| 2 , |a| 2 ,0) = -y?((|rf , |a| 2 ,0). (A.6) 



Hence, a i— > y9((|r| 2 , |a| 2 , 0) = on {r} 1 - and, by Equation (|A.5|) , a 



f(r) • a = on {V}- 1 . In other words, f(r) G {{^'}" L }" L - In our finite 
dimensional (reflexive) space, {jf}" 1 }" 1 " = {r}, so f(r) G {r}. Equivalently, 
there must be a scalar function r \— > <p(r) such that f(r) = (p(r)r. (For 
definiteness, define </?(0) := 0.) It is easy to show that the |isotropy| of / 



implies the |isotropy| of (p. By Part (lip of the Lemma, there is a scalar 
function (p on M. + such that <p{r) = (p(\r\ 2 ) and, hence, f(r) = (p{\r\ 2 )r. 
Part (|P is proved. 

To verify Part (|3|) we start with two observations. From Equation ( |4.6|) 
we see that the matrix -F(O) commutes with all orthogonal matrices. By 
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Theorem 1 ~A, F(0) = k1, for some scalar constant k. Thus Part (pi) cer- 



tainly holds if r = 0. Henceforth, we suppose r ^ 0. Note also that 
Equation ( |4.6| ) implies that F(r) = F(—r). We will verify the alternate 
statement of Part (|j). 

Set f(r) := F{r)r. It is easy to see that / satisfies Equation ( |4.5|) for all 
orthogonal transformations Q, so it is |isotropic~| By Part (^|) of Lemma ( |4.1|) , 
we must have f{r) = A(|r| 2 )r, for some (real-valued) scalar function A. Thus, 
F{r)r = A(|r| 2 )r and, hence, (A(|r| 2 ); r) is an eigenpair for F(r). The same 
argument shows that (A(|r| 2 ); r) is also an eigenpair for the transpose F (r). 
Indeed, since the vector- valued function F (r)r is also [isotropic] there is a 



scalar function fi such that F T (r)r = fi(\r\ 2 )r, so F(r) has the eigenpair 
(/i(|r| 2 ); r). But F T {r)r»r = F(r)vr, so A = /i. (In fact, all the principal 
invariants of F(r) are isotropic scalar functions of r, so all the eigenvalues of 
F(r) and F (r) are scalar-valued functions of |r| 2 .)Q Surely the subspace 
{r} is invariant under the action of F(r). Let a G {^j^- Then 

F(r)a •r = a» F T (r)r = A(|r| 2 )a • r = 0. (A.7) 

Hence F(r)a e {t*}" 1 ; that is, the subspace {r} 1 - is also invariant under the 
action of F(r). Using the projections P(r) and P (r) defined in Equa- 
tion ( |4.2| ), F(r) must have the decomposition 



F(r) = P(r)F(r)P(r) + P ± (r)F(r)P ± ( y r 
= A(|r| 2 )P(r) + P ± (r)F(r)P ± (r). 



(A.8) 



Define F ± (r) := P ± (r)F(r)P ± (r). Clearly F ± (r) is |isotropic.| Then, 



for every orthogonal Q that leaves the subspace {r} invariant, we have 



T 



F ± (Qr) = F ± (±r) = F ± (r) = QF ± (r)Q . (A.9) 

This follows since Qr = ±r and i r '" L (r') = i r '" L (— r). Therefore, -F'" L (t') 
commutes with every orthogonal matrix Q that leaves {r} invariant. But 
the set of all orthogonal transformations that leave the subspace {r} invariant 
is a subgroup equivalent to the group of all orthogonal transformations on 
{r}- 1 . Again, using Theorem |? !4|, the action of F 1 ^) on {r} 1 - is a multiple 



of the identity on {r*} , which multiplier depends only on r. Thus, 

F ± (r)=ip(r)P ± (r), (A.10) 



96 The arguments of this paragraph and the next do not require that F(r) be symmetric. 
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for some scalar function ip(r). Again, it is easy to show that ip(r) is a scalar- 
valued isotropic function, so ip(r) = Aj_(|r| 2 ) for some scalar-valued function 
Aj_. In summary, 

F(r) = A(|rf)P(r) + X ± (\r\ 2 )P ± (r), (A.ll) 

which is precisely Equation ( [4.10[ ) in the alternate statement of Part (^). 
The proof of Part @ is complete upon identifying the eigenvalue A(|r| 2 ) 
with (7(|f| 2 ) + M 2 ?7(M 2 )) and the eigenvalue Aj_(|r| 2 ) with 7(H 2 ) in Equa- 
tion (p|). 

Finally, for d > 2, F(r) is always symmetric with either one eigenvalue 
of multiplicity d or two distinct eigenvalues with multiplicities 1 and d — l.m 

B Stochastics 

B.l Space-Time White Noise 

Let j3(ds) denote the standard scalar- valued Brownian motion and let 
A, A\, A 2 be Borel subsets of [0, oo) of finite Lebesgue measure. Here, \A\ de- 
notes the Lebesgue measure of A, etc. Then J A f3(ds) is normally distributed 
with mean and variance \A\. Furthermore, if A\ fl A 2 = for two Borel 
subsets of [0, oo) of finite Lebesgue measure, then f A (3{ds) and f A /3(ds) 
are independent. 

Space-time white noise, w(dq,ds), is a straightforward generalization. 
Let B,Bi,B2 be Borel subsets of M. d of finite Lebesgue measure. Then 
j A j B w(dq,ds) is normally distributed with mean and variance \A\ ■ \B\. 
Furthermore, if {B\ x A\) fl (B 2 x A 2 ) = 0, then j A f B w(dq,ds) and 
Ia Ib w (dq,ds) are independent. 

B.2 Marginals for Pair-Paths 

Let r a (-,T*Q) denote the random path in M. d traced out by the position of 

the ath large particle with initial random position r$ G M d , a = 1,2. Then 

fr l {- r l )\ 
the marginal distribution of each component of the pair-path I 2 ) ' 2\ I m 

C([0, oo), ~R 2d ) is the distribution of a Wiener measure on C([0, oo), M. d ) with 
initial support in the point r$ ; that is, on the space of M. d - valued continuous 
functions, /(■), on [0, oo) such that /(0) = Vq, V/. 
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B.3 Quadratic Variation 

Let m(-) and n(-) be two vector-valued continuous martingales on the in- 
terval / = [0, T], T > or / = [0, oo). The quadratic variation of fn(-), 
denoted by ((m)}(-), is the unique, increasing, adapted, matrix- valued pro- 
cess on I such that ((m))(0) = and m(-)m T '(•) — ((m))(-) is a continuous 
martingale.Q Similarly, the cross quadratic variation of m(-) and n(-), de- 
noted by ((m, n))(-), is the unique, adapted, matrix- valued process on I such 
that ((m, n))(0) = and m(-)n T (-) — ((m, n))(-) is a continuous martingale. 
Equivalently, we have ((m, n)) = |(((m + n)) — ((m — n))). 

Some of our conclusions depend on the following two well-known results. 
(See Ikeda and Watanabe (1981) [^] for details.) 

Theorem B.l (Levy-Ito Theorem) 

If ((m))(t) = tB, i 6 I, where B is a positive definite symmetric matrix, 
then m(-) is an adapted Brownian motion (Wiener process) with independent 
increments. If B = 1, then m(-) is a standard Brownian motion. 

Theorem B.2 

If the martingales m(-) and n(-) are uncorrelated on I, then ((m,n))(t) = 

o, t e I. 

Computation of the (Tensor- Valued) Quadratic Variation of Equa- 



tion ( |57L8D 



Start with the the definition of m a (t) given in Equation (|5! 1]). To simplify 
the argument, assume that the underlying position process r a (-) is determin- 
istic rather than stochastic. Next, partition the interval [0,t], t E I, t > 0, 
into non-overlapping small intervals {Ai : i = 1, 2, . . . , M} and partition M. d 
into non-overlapping small d— parallelepipeds {Bj : j = 1,2,... oo}. Let Si 
denote a time in Ai and let q 1 denote a location in Bj. Then, m a (t) is given 
approximately by 



M oo 



™ Q (^EE9( r ^)-^ w(dq,ds). (B.l) 



i=l j=l 



Using the same partition, do the same, mutatis mutandis, for mr{t). Un- 
der our simplifying assumption, the quadratic variation is the covariance 



97 Here, increasing means that the quadratic form t i— » ((m))(t)a • a is increasing for 
every constant vector a. 

July 31, 2008 - 13h:54m 



B STOCHASTICS 



82 



((m a ,m /3 ))(t) = E[(m a (t) - E[m a (t)])(m^(t) - E[m^(t)]) T }. The proper- 
ties of the generalized space-time white noise guarantee that the processes 
I a Ib w {dq., ds) each have mean zero and variance |Aj||.Bj|; moreover, these 
processes are independent; that is, 



E 



E 



At JB 



E 



w(dq, ds) 



Ai JBj 



w(dq, ds) 



w(dq, ds) 



w(dq, ds) 



0. 



lAII-Bfl 

I I' II J \ 



(B.2) 



A,, JB 



0, iii^kj^l. 



Thus, 



(m a ,mP)){t) = E[(m a (t))(m p (t)y} 
jvi oo r /* /* /* p 

J2J29(r a (si)-q j )9 T (r (s k )-q l )E / / w(d qi ds) w(dq 

i,k=lj,l=l VJaJbj JA h JB l 

M oo V / r r 

= Y,Y,9(r a ^)-q 3 )9 T (r P (s l )-q^)E ( / / w(dq,ds) 
i= i j= i VKJaJb, 

M oo 

= EE^W - ^WM - q?)\MBi\ 

i=l j=\ 

M oo r r 

i=i i=i ^ -^ 



ds) 



I l g(r a (s)-q)g T (r^s)-q)dqds. (B.3) 

JO JR d 



This essentially establishes the formula in Equation ([T ~E). The argument 
above should properly be modified to take into account the fact that the 
underlying position processes, r Q (-) = r a (-,uj) and r /3 (-) = r^(-,u), are 
stochastic, not deterministic. In this case, the expectations involved must be 
conditioned on the underlying filtration.^] We omit the details here. 



98 Recall that the underlying probability space is (fl,J 7 ,'P). The underlying filtration is 
an increasing family of a— fields \Tt '■ t € 1} in T determined in a natural way through 
the histories of the processes r a (-) = r a (-,u)) and r^(-) = r"{-,oS). 
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Following the convention for pair-processes established in Section <~1, 
consider the pair-martingale for m(-) and n(-): 

* = (») ■ < B - 4 > 

Then we have the block form for the martingale 

„ A t fmm T mn T \ ,_. _s 

mm. = T T . (B.5) 

ynm 1 nn J 

Hence, the quadratic variation of m must have the block form 

« m »-V<(n,m)> ((n» J" (R6) 

From this and Theorem |3J] we see that the joint motion is Brownian if 
((m))(t) = tB, for some positive definite and symmetric matrix B, and stan- 
dard Brownian if ((m))(t) = tl. Of course, ((m))(t,f ) = J Q D(r(s,r ))ds, 
where D is given in Equation ( |5.28| ) and f (•, f ) is the joint motion of the two 
large particles. The computations in Appendix verify these observations 
explicitly for the [Maxwell kernel . 



C Outline of the Proof of Theorem [571] 



The proof of Theorem |5.1| follows directly from Ito's famous formula. Here 
are the essential steps in the argument. 

Recall that the pair-process r(-,r ) is the solution of the stochastic in- 
tegral equation given in Equation ( |5.10D . Rewrite this equation using the 
notation established in Appendix |B.3| , in particular Equations ( |B.4| ), ( |B.5[ ), 
and fyjp . Thus, 



f(t)=fo + m(t), (C.l) 

where rh(t) is the martingale defined by the second term on the right-hand 
side of Equation (|5!l0D. Let y h- > f(y) be a twice continuously differentiable 
scalar- valued function on IR 2d .0 Then, using Equation ( |5.10D and noting the 



"These functions arc dense in the class of bounded measurable functions on 
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differentiability of ((m))(-), Ito's formula reduces tcf"^ 



f(r(t)) - /(fo) = E jf (J-/)(*00)<M* 



-1 2d „i 



(C.2) 
<9 2 A d 



/ (r(s)) — ((m,,m m ))(s)ds. 



2 , i Jo \dfidf m J ds 

l,m=l 

The first term in Equation (|C^j), containing the first-order derivatives of /, 
is an Ito integral that is a mean zero (square integrable) martingale. Suppose 
the initital condition f is deterministic; call it x, as in Section |5\3| . Compute 
the mathematical expectation of both sides of Equation ( |C.2| ) (conditioned 
on fo = x) and then differentiate with respect to t at t — 0. Noting that 
!«m))(0) =£>(*), we geQ 

±M = \£ A.(*)(^/)(4). (C3) 

The partial differential operator on the right-hand side of Equation (|C.3|) 
is the generator of the pair-process semigroup {T t : t > 0} given in Equa- 
tion Q5.31D by (T t f)(x) = Ex[f(r(t,r ))], where / is now a bounded and 
measurable function on M. 2d and x is any vector in M? d . 



D Diffusion Matrix for the Maxwell Kernel 

Here are the essential details of the computation of the covariance matrix 
function D = D 12 , defined in Equation ( |5.20|) for a, (3 — 1,2, when the un- 



derlying velocity distribution of the small particles is a Maxwell distribution. 
The Maxwell kernel| is g e (r) = K e ^rexp ( — y-), as defined in Equa- 



100 Ikeda and Watanabe pl|(1981, Section 5, Theorem 5.1). The matrix components 
{rh))i m are ((mj, m m )}, I, m= 1,2,..., 2d. 



101 f t m means |/>(t))| t=0 . 
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tion (|5~T^ ). We need to show thatQ 

D?(V2x) = \tJfie)*e->£ (el - |*| 2 P(*)) 

= e ^lM)ie-^ ((l - ^)P(*) + P^x) 
Equivalently, in component form, 



(d.i; 



DIUV2X) := \*U-e)ie-^ { " ***'' * f * J' (D.2) 



d '~' 2 



(e — a^), if fc = I. 



Start with Equation (|5.20| ) in component form: 

D] 2 M {r) = [ geAr 1 - q)ge,i(r 2 - q) dq. (D.3) 



The first observation is that for a = 1, 2 

K-g| 2 - 



# e ^r" - gj = K e ,dlT - Qi) ex Pl 



a / \r a -q\ 2 



2r 



(D.4) 



Hence, using integration-by-parts, 

/ \2 /" / l rl _< ?l\ d 2 i \r 2 -q\ 2 , , 
= -(«e,d£) / ex P( s ) 5 q e x P( „ )«<? 



€) 2 / exp( ^— -^ 



2e <9g fc <9g z Is 

i_„,2 (D.5) 



x 



e e ^^ 2 £ 

1 i ( r l~1k) 



exp(-^ T ^L)dq, if fc^Z, 



+ ^^)exp(-^-^)rfq, iffc = /. 



Next, using shift invariance, we obtain for k ^ I 

p II 2 1 2 i I 1 2 

D l 2 ki(r) = -n 2 d exp( )g fc gzrfq (D.6) 

./rod <££ 



102 Observe that the radial component of Dl 2 is positive if | as | < y/e and negative if 
\x\ > *Je, where v^ is the correlation length. The transverse component of D 12 is always 
positive. 
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and for k = I 



I 1 2 I 2 112 

£?«(*)= -*L/ exp(- |r ~ r ~ g| + I<Z| )(- e + g g)dg. (D.7) 



Now if we use the identity 



exp( 



\ r - q\ + \q\ 



2e 



2e 



I 19 I 1 12 
V \ ! \<l-V'\\ 
eX Pl 7T ) eX P( )» 



we obtain for k ^ I 



Df M {r) 



and for k = I 



-K 2 d 2 exp( 



I "I 2 1 2 



is 



4s 



exp( 



(D.8) 



|l( r l_ r 2)_ g |2 



D lli(r) = -«Lexp( 



I I 2 1 2 



exp( 



iKr 1 -^ 



4s ' ,/th 
A standard integration then yields 

1..2 ^^#_^ | r l_ r 2|2 ^ (r l_ T .2 )(r l, r . 2) 

-e 



QkQi dq 
(D.9) 

i 

-)(-£ + 4) dq. 

(D.10) 



D'Ur) ■■ 



■l K ld^ £ ) 2ex PK- '' 4e ' ) 



l^-r^S / g i (r^-rg) 2 - 



i^d^jaexp^-i— ^ 



if k^l, 
if fc = I. 

(D.ll) 

Finally, using r 1 — r 2 = v^a;, we obtain Equation (|D.2|) or, equivalently, 
Equation ( p.l| ). 



E General Structure of the Diffusion Matrix 

E.l Proof of Lemma \&.'Z\ 

Fix x t^ 0, and let a be an arbitrary constant vector. Compute the quadratic 
form a T (C ± D(x))a. Referring to Equation ( 5.20Q , this can be cast in the 
form 

a T (C±D(x))a= [ (ip 2 {-q) ± ip{- V2x - q)ip{-q)) dq, (E.l) 
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where ip here denotes the function ip 
invariance implies J Rd ip 2 (—q) dq = jj, 
tion ( |E. 1| ) is the same as 

a T (C±D(x))a = - 



= a T g. Observe that translation- 
ip 2 (—y2x — q) dq. Hence, Equa- 



/ {ip 2 {-q)±2i,{-V2x-q)4,{-q) 



+ ij 2 (-V2x-q))dq 



(E.2) 



i J U(-q)±^{-V2x-q)j dq 



So a T (C ± D(x))a > 0, for all a ^ 0. 

Next suppose that for some a ^ 0, a (C ± D{x))a = 0. Then ip(—q) ± 
ip(—\/2x — q) = for almost every qr. That is, the function ip is periodic, with 
period either \[2x or 2\[2x. Since ip = a g must also be integrable over M d , 



»r 



it follows that the function ip = a g is zero almost everywhere in R . Thus, 
a»g(Qq) = for any orthogonal transformation Q and almost every vector 

or, equivalently Q T a»g(q) = 0. 
for almost every q, 



q. Since g is [isotropic] we get a • Qg(q] 

But Q is any orthogonal transformation, so g(q 
which contradicts our assumption. 



E.2 Detailed Structure of the Matrix Function D 



Here we consider the matrix function x h-» D(x) defined in Equation (|5.48|) 
of Section |5^4|. There, in Equations ( |5.50| -|5~ >5|) we saw that 

D(V2y) = A(y) - \y\ 2 (3{y)P{y) 



a ± (\y\ 2 )P±(y) + H\y\ 2 ) - \y\ 2 (3(\y\ 2 ))P(y), 



(E.3) 



for scalar functions £ h-> «(£ 2 ), a_i_(£ 2 ), /3(£ 2 ). We provide explicit formulas for 
these functions in terms of the underlying |forcing function! (p. The following 
definitions will help simplify the process. 



Definition E.l 

For vectors y,q in M. d , define scalars £, £, tp through 

£'=\v\ C-=\q\ £(cosip:=yq 
and set 



(E.4) 



(E.5) 
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It follows from these definitions that 



a > 6 > and a = b iff £ = C 
0<a-b=^-() 2 0<a + b=(Z + () 2 



(E.6) 



The next definition will also prove useful. 

Definition E.2 

For f , C > and < y? < tx, 

$(£, C, ip) := 0(a — b cos y?)0(a + b cos <p) (E.7) 

= <P(\q-y\ 2 )<P(\q + y\ 2 ) (E.8) 

Lemma E.l 

The functions £ i— > a(£ 2 ), «_i_(£ 2 ),/3(£ 2 ) are given through the integrals: 

y (C 2 $(£, C, ¥>) cos 2 y?) C^ 1 sin d " 2 if dip dC, (E.9) 
a ± (a = 2^ 2 ^^ 2 (c 2 $(e,C,^)^Y)c d " 1 sm d - 2 ^^dC, (E.10) 

/3(£ 2 ) = 2a;,_ 2 / / $(£, C,¥>) C -1 sin^ 2 y> # C (E.H) 

Jo Jo 

In paticular, we have 

a(0) = ax(0) = ^ff CW)^ 1 ^C (E.12) 

/•oo 

P(0)=v*-i <f> 2 (( 2 )( d - 1 d( (E.13) 

Jo 



Proof of Theorem |5.4j . Recall that the eigenvalues cr_|_(£ 2 ), cr(£ 2 ) are given 



through Equation ( |5.61|) from which it follows that <t_l(£ 2 ) < <r(£ 2 ) if, and 
only if, a(e) - a±(£ 2 ) < ^(f)- Since /3(£ 2 ) > 0, a ± (£ 2 ) < a(£ 2 ) will 
certainly hold whenever a(£ 2 ) — a_i_(£ 2 ) < 0. Thus, D is radially dominance 
whenever a (£ 2 )— «_l(£ 2 ) < 0. The proof is complete upon proving Lemma |E 
below, which asserts that the logarithmic convexity of <fi implies a(£ 2 ) — 
a ± (e) < 0. ■ 
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Lemma E.2 

U^eW\e) < (<P\e))\ V£ > 0, then < 2 ) - a±(£ 2 ) < O, V{ > O. The 
former condition is equivalent to (hi<f)(z))" < 0, Vz > 0; that is, <fi is loga- 
rithmically concave^^ 

The following computational result will prove useful in the proofs of Lem- 
mas |E.1| and | J2[ We omit its elementary proof. 



Lemma E.3 

Let (y,q) i— > 7(|?/| 2 , \q\ 2 , y q); so 7 is an isotropic scalar-valued function of 
y and q. For fixed y, suppose q h-» |7(|y| 2 , \q\ 2 , y • q)\ is integrable overW 1 . 
Then, for d > 20 

/ l(\y\\\q\\yq)dq = u Jd - 2 / 7 (£ 2 , C 2 , fCccs^C*" 1 sin d "V dtp d(. 

Jw d Jo Jo 

(E.14) 



Proof of Lemma [ILT|. From Equations (|5.52 ) and (|5.55 ), together with 
Equation (E.14 ) of Lemma |E3J we see at once that 



m 2 



where 



<P{\y? + \q? + 2|y||g| cos<p)<t)(\y\ 2 + \q\ 2 - 2\y\\q\ cos</?) dq 



/*00 /»7T 

^d-2 / / ^(CC^C^sin^V^C 

JO JO 

2^ d _ 2 / / $(£, C, </?) C"" 1 sin d " 2 cp dtp d(. 
Jo Jo 

(E.15) 



*(£,C,¥>) = 0(r + C 2 + 2£Ccos^)0(£ 2 + C 2 - 2eCcos^). (E.16) 



The last step follows from the fact that <&(£, £, y?) is even about ip = ir/2. 
Next consider the decomposition 



A(y) = P ± (y)A(y)P ± (y) + P(y)A(y)P(y) 

= a ± (\y\ 2 )P ± (y) + a(\y\ 2 )P(y). 



(E.17) 



103 Rccall that we have always assumed that <\> > and <f>' < 0. 
104 Rccall that u>d-2 denotes the surface area of § rf ~ 2 . 
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This implies immediately that 

a{\y\ 2 ) = trace{P{y)A{y)P{y)) = A{y) • P(y) 

a^\y\ 2 ) = -^— l trace{P\y)A{y)P\y)) = -^jA(y) . P\y). (K18) 

Now use the formulas 

\P{y)q\ 2 = trace(P(y)qq T P(y)) = ( ^ ] = ( 2 cos 2 <p 

V \y\ J 
\P\y)q\ 2 = trace(P\y)qq T P\y)) = \q\ 2 - (^-J = ( 2 sin 2 ^ 

(E.19) 



together with the definition of A(y) in Equation ( |5.51|) and Equation ( E.14|) 



of Lemma ^J73| , to conclude the first two formulas in Lemma E.l| I 



It is easy to see that the inequality cr_i_(£ 2 ) < o - ^ 2 ), for the eigenvalues 
of the |difiusion matrixj D, is equivalent to the inequality a(£ 2 ) — a_i_(£ 2 ) < 



£ 2 /3(£ 2 ). Hence, the result of Lemma [ETT1 indirectly supplies a criterion that 
°j_(£ 2 ) < °"(£ 2 ) be satisfied. As this is too unwieldy, we provide a simple 
sufficient condition that a(£ 2 ) — a±{£, 2 ) < 0. Since /3(£ 2 ) > 0, this will in 
turn guarantee that <t_i_(£ 2 ) < &{£, 2 ) holds; that is, the |dittusion matrix is 



radially dominant, which is the content of Theorem [5__l. 

Proof of Lemma [I 7^. Use the first two formulas of Lemma |E.1| and 
an integration by parts to get 

a(£ 2 )-«±(£ 2 ) 

= 2^ 2 £'' ! |°° ( d+1 m C, v)dCj( cos 2 ip - -^ sin 2 tp\ sin"*" 2 <p dtp 

= -2^-2/* 2 f J°° C d+1 ^{iX^)dC J (^ycos^sin^v) dip. 

(E.20) 

Thus, a(£ 2 ) - a±(£ 2 ) < whenever ^$(£,C,<^) > 0, For £, C > and 
< ip < 7r/2. But 

JU(£,0) = -2^sin^(e,C^)(ln0W)"| 2=c2+c2+ ^ Ccosv (E.21) 

for some r\ G [—1, 1]. Hence, (ln</>)" < guarantees that Tp<&(£, C> V 9 ) > ■ 
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F van Kampen's 1— Dimensional Flux Rate 



van Kampen [[ 5|(Ch.VIII.l) considers one-dimensional diffusions with drift 
a(£) and quadratic variation b(£) > 0, where both coefficients are bounded 
with continuous derivatives. The Fokker-Planck equation for such diffusions 
is 

!*&*) = ~| («(£)*(£,*)) + \^{bW(t,t)), (F.i) 

where X(£, t) is the probability density at the point £ and time t. van Kam- 
pen then defines the probability flux of the one- dimensional diffusion to be 

J(U) ■= a(t)X(t,t) ~ ~-^(mX(U))- (F.2) 

If the probability density, X(C,,t), is locally spatially constant, this reduces 
to 

J{U):=U^)-~(m))x(U)- (F.3) 

From the one- dimensional generator A in Equation (|6.5|) , we see that a(£) = 
|^-ct_l(£ 2 ) and fe(£) = (t(£ 2 ).[^| If we apply Equation QF.3|) above we recover 
the 1— dimensional van-Kampen flux rate of Equation (|7.22|) . 



105 The differential operator in the right-hand-side of the Fokker-Planck Equation (F.f ) 
is the adjoint of the generator A. 
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